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Phase  I  of  this  project  represented  an  attempt  to  obtain  a  picture  of  the 
effects  of  cultural  factors  on  existing  selection  procedures  using  psychological 
tests.  In  examining  765  predictor-criterion  relationships  219  cases  were  found  of 
unfairness  to  a  racial  subgroup  had  these  tests  been  used  for  selection. 

Phase  II  represented  an  attempt  to  obtain  a  picture  of  the  effects  of  cultural 
factors  on  less  traditional  measures.  Eight  studies  were  carried  out  including 
as  predictors  work  sample  tests,  simulators,  motor  skill  tests,  culture-free 
tests,  biographical  information,  and  learning  measures.  Altogether,  kOO  additional 
predictor-criterion  relationships  were  examined  including  272  with  these  less 
traditional  predictors. 

The  use  of  less  traditional  predictors  did  not  result  in  an  improve¬ 
ment  in  validity  or  in  a  reduction  of  unfairness.  The  most  frequent  occurrence 
was  a  lack  of  validity  for  either  ethnic  group,  with  the  finding  of  validity  com¬ 
bined  with  no  unfairness  being  exceedingly  rare.  Although  some  of  the  less 
traditional  showed  promise  it  was  concluded  that  situation- specific  validation 
with  all  predictors  appears  essential  in  order  to  avoid  possible  inadvertent 
bias  to  minority  groups  through  unfair  selection  measures. 
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Introduction 


Phase  I  of  this  project  (O'Leary,  Parr,  and  Bartlett,  1970)  represented  an 
attempt  to  obtain  a  picture  of  the  effects  of  cultural  factors  on  existing 
selection  procedures  using  psychological  tests.  Seven  independent  studies  were 
conducted  in  which  the  validity  of  commercial  and  industrially  developed  selec¬ 
tion  tests  examined  separately  for  white  and  Negro  subgroups.  Examining  mean 
differences  in  predictor  and  criterion  measures  along  with  the  separate  validity 
coefficients  765  predictor-criterion  relationships  were  then  classified  accord¬ 
ing  to  the  differential  prediction  models  discussed  by  Bartlett  and  O'Leary 
(1969).  Of  the  765  relationships  examined  357  Bart lett -O ' Leary  models  were 
described  with  219  cases  of  unfairness  to  a  racial  subgroup  had  these  tests  been 
used  for  selection. 

Phase  II  of  this  project  represents  an  attempt  to  obtain  a  picture  of  the 
effects  of  cultural  factors  on  less  traditional  measures  which  might  be  utilized 
in  selection.  Due  to  the  extent  to  which  traditional  psychological  tests  might 
lead  to  inadvertent  racial  discrimination  it  is  hoped  that  less  traditional  meas¬ 
ures  stay  show  fewer  Instances  of  unfairness  when  used  for  personnel  selection. 

The  purpose  of  the  second  phase  of  this  project  is  to  examine  a  number  of  these 
less  traditional  measures  and  evaluate  them  with  regard  to  their  fairness  in 
selection  from  racially  mixed  groups. 

Study  #1  was  carried  out  on  a  racially  mixed  group  of  taxicab  drivers. 
Driver  training  simulators  and  a  variety  of  psychomotor  tests  were  used  as  pre¬ 
dictors.  Criteria  included  observation  of  street  errors,  accident  record,  and 
traffic  violation  record. 

Study  #2  was  carried  out  on  a  group  of  high  school  students  with  several 
biographical  inventory  keys  utilized  as  predictors  along  with  a  traditional 
ability  test.  Grades  in  school  and  teacher  ratings  were  used  as  criteria. 
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Study  #3  examined  the  prediction  of  college  grades  using  an  opinion  ques¬ 
tionnaire  (University  Student  Census)  as  a  predictor  as  veil  as  personality  and 
interest  inventory  measures  in  order  to  determine  if  these  measures  could  improve 
the  standard  academic  predictions* 

Studies  #4  and  #5  examined  workers  in  a  health  Insurance  company.  The  SRA 
Pictorial  Reaaonlrg  Test  which  has  been  alleged  to  be  culture-free  vas  compared 
to  more  traditional  ability  measures  in  the  prediction  of  supervisor  ratings. 

In  these  studies  a  measure  of  cultural  deprivation  suggested  by  Guion  (1966)  vas 
employed.  This  measure  utilizes  the  discrepancy  between  the  scores  on  the 
culture- bound  test  and  a  culture-free  test  as  either  a  moderator  or  a  predictor. 

Study  #6  compared  Fundamental  Achlsvaasxt  Series,  another  measure  regarded 
as  less  culturally  biased,  with  traditional  ability  measures  In  the  prediction  of 
success  in  Marine  Corps  training.  As  in  the  two  previous  studies  the  cultural 
deprivation  index  suggested  by  Guion  vas  employed. 

Study  #7  examined  a  work  sample  testing  procedure  along  with  psychomotor 
tests  in  the  prediction  of  turnover  and  training  criteria  for  sewing  machine 
workers.  The  work  sample  test  provided  a  career  preview  of  the  Job,  serving  as 
a  Job  expectancy  training  program  as  veil  as  a  selection  instrument.  Thus  aelf- 
selectlon  as  well  as  administrative  selection  vas  investigated. 

Study  #6  examined  experimental  learning  tasks  as  predictors  of  programmed 
learning  performance  using  a  pre-post  design.  Three  different  learning  tasks  were 
utilized:  a  paired  associate  task;  a  concept  learning  task;  and  a  principle  learn¬ 
ing  task.  The  criterion  consisted  of  learning  performance  on  a  programmed  instruc¬ 
tional  unit  as  measured  by  gain  in  proficiency  on  a  multiple  choice  test. 

All  of  these  less  traditional  predictions  were  then  examined  in  terns  of  the 
differential  prediction  models  described  by  Bartlett  and  O'Leary,  and  frequency 
of  selection  bias  vas  compared  with  more  traditional  predictors  typically  used 
in  personnel  selection. 
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The  Bartlett  -0  *  Leary  Models 


All  of  the  relationships  are  classified  according  to  the  differential  pre¬ 
diction  models  which  have  been  described  in  detail  by  Bartlett  and  O'Leary  (1969)* 
Scatter  diagram  representations  of  these  eleven  models  are  presented  in  Figures 
1  through  11.  The  models  are  summarized  briefly  as  follows: 

Model  1  -  The  predictor  is  valid  for  both  subgroups,  and  significant  mean  dif¬ 
ferences  occur  on  both  the  predictor  and  the  criterion. 

Model  2  -  The  predictor  is  valid  for  both  subgroups,  and  significant  mean  dif¬ 
ferences  occur  on  the  predictor  but  not  the  criterion. 

Model  3  -  The  predictor  is  valid  for  both  subgroups,  and  significant  mean  dif¬ 
ferences  occur  on  the  criterion  but  not  the  predictor. 

Model  U  -  The  predictor  is  valid  for  both  subgroups,  and  significant  mean  dif¬ 
ferences  occur  on  the  predictor  and  the  criterion,  but  in  opposite  directions. 

This  model  differs  from  the  first  one  in  that  the  Intercepts  of  the  two  regres¬ 
sion  lines  are  different  for  this  one. 

Model  5  -  The  predictor  is  valid  for  one  subgroup  but  not  the  other,  and  there 
are  no  significant  mean  differences  on  either  the  predictor  or  the  criterion. 

Model  6  -  The  predictor  is  valid  for  one  subgroup  but  not  the  other,  and  there 

are  significant  mean  differences  on  the  criterion  but  not  the  predictor. 

Model  7  -  The  predictor  is  valid  for  one  subgroup  but  not  the  other,  and  there 

are  significant  mean  differences  on  the  predictor  but  not  the  criterion. 

Model  8  -  The  predictor  is  valid  for  one  subgroup  but  not  the  other,  and  there 

are  significant  mean  differences  on  both  the  predictor  and  the  criterion. 

Model  9  -  The  predictor  is  valid  for  both  subgroups,  but  in  opposite  directions. 
There  are  no  mean  differences  on  either  the  predictor  or  the  criterion. 

Model  10  -  The  predictor  is  valid  for  both  subgroups,  but  in  opposite  directions. 
There  are  significant  differences  on  the  predictor  but  not  the  criterion. 

Model  11  -  There  is  no  validity  in  either  subgroup,  but  there  are  significant 
differences  on  both  the  predictor  and  the  criterion. 
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SCATTER  DIAGRAM  REPRESENTATIONS 
OF  THE  BARTLETT  -  O'LEARY  MODELS 
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Study  #1:  Taxicab  Drivers 


The  Relationship  between  Street  Driving  and  Simulator  and 
Perceptual-Motor  Ability  Test  Performance 


Sample 

This  investigation  represents  a  reanalysis  of  data  reported  by  Edvards, 
Hahn  and  Fleishman  (19&9)  the  American  Institutes  for  Research.  The 
present  study  reports  data  analyses  of  the  total  group  and  the  two  racial 
subgroups.  Complete  details  of  the  original  study  are  found  in  Edvards, 
et  al.  The  taxicab  driver  sample  consisted  of  303  workers  of  whom  152  were 
white  and  151  were  Negro.  All  of  the  drivers  were  employed  in  Washington, 

D.C.  and  generally  drove  in  city  traffic.  As  can  be  seen  in  Table  1,  the 
white  subgroup  was  older  than  the  Negro  subgroup  and,  consequently,  also  had 
more  years  of  driving  experience  than  the  Negro  subgroup.  The  Negro  taxicab 
drivers  had  attained  a  significantly  higher  level  of  education  than  the  white 
subgroup.  The  two  subgroups  had  approximately  equal  annual  driving  mileage. 
All  subjects  were  paid  for  their  participation  in  the  study. 

Predictors 

For  the  purposes  of  the  present  data  analyses,  a  number  of  variables  were 
investigated  as  possible  predictors  of  driving  behavior.  Among  these  vere 
two  laboratory  driving  simulators  and  several  psychomotor  abilities  measures 
including  simple  reaction  time,  Jump  reaction  time,  motor  Judgment  and  complex 
coordination  test. 

The  two  driving  simulators  employed  were  the  Allstate  Good  Driver  Trainer 
(AGDT)  and  the  Aetna  Drivotron.  Both  simulators  consisted  of  a  cab  in  which 
the  driver  sits,  a  motion  picture  screen  and  projector,  and  scoring  consoles. 
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Table  1:  Biographical  Data  -  Taxicab  Drivers 


Variable 

Group 

X 

s 

N 

t 

Age 

Total 

44.61 

12.55 

300 

White 

50.07 

11.86 

150 

1 

Negro 

39.15 

10.73 

150 

6.33** 

Education 

Total 

11.14 

2.95 

299 

(years) 

White 

10.70 

2.61 

149 

Negro 

11.57 

3.03 

150 

2.57* 

Driving 

Total 

26.43 

12.60 

303 

Experience 

(years) 

White 

31.41 

12.46 

152 

Negro 

21.42 

10.62 

151 

7.48** 

Annual 

Total 

36.64 

15.32 

272 

Driving 

Mileage 

White 

37-34 

13-23 

145 

(in  000’s) 

Negro 

i  40.13 

17.33 

127 

1.50 

*  P  <  -05 


**  p  <  .01 
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The  Motion  picture  presented  various  driving  situations  and  the  driver  reacted 
by  steering,  braking;  signaling;  etc.,  in  response  to  the  situations  presented. 
The  total  nuatoer  of  incorrect  responses  vas  used  as  the  only  predictor  measure 
with  the  AGDT.  When  the  Drivotron  vas  used,  several  Measures  including;  steer¬ 
ing  errors,  speed  errors,  brake  errors,  signal  errors,  and  number  of  starbursts 
(masber  of  accidents  which  would  have  resulted  from  errors  comitted)  were  com- 
bined  to  provide  total  error  scores.  Each  simulator  drive  ranged  from  20-25 
minutes.  Half  of  the  sample  began  with  the  Drivotron,  followed  by  the  AGDT; 
the  other  half  was  given  the  AGDT  followed  by  the  Drivotron. 

In  addition  to  the  driving  simulator,  the  following  measures  were  investi¬ 
gated  as  potential  predictors: 

a)  Simple  reaction  time  vas  measured  by  the  time  required  for  the  S  to 
depress  a  response  button  after  the  onset  of  a  stimulus  light.  The  S's  hand 
rested  on  the  response  button  at  the  beginning  of  each  trial.  A  total  score 
comprised  of  the  sum  of  reaction  times  over  20  trials  vas  used. 

b)  Jump  reaction  time  was  measured  similarly  with  the  additional  require¬ 
ment  that  the  8  place  his  hand  12  inches  away  from  the  response  button  until 
the  stimulus  light  vas  activated. 

c)  The  Motor  Judgment  Test  measured  the  ability  to  stake  continuous  antici¬ 
patory  motor  adjustments  relative  to  changes  In  speed  and  direction  of  a  continu¬ 
ously  moving  target.  In  the  Motor  Judgment  Test,  the  S  vas  confronted  by  two 
adjacent  disks  rotating  at  a  constant  speed.  Each  disk  bad  black  and  white 
sections  on  its  perimeter.  Between  these  disks  vas  a  pointer  whose  speed  of 
rotation  the  S  could  control.  The  S  vas  not  able  to  stop  the  rotation  of  this 
pointer  completely  nor  could  he  exert  control  over  the  two  rotating  disks.  The 

3  vas  required  to  make  as  many  revolutions  of  the  pointer  as  possible  without 
crossing  the  black  areas  on  the  rotating  disks.  To  do  this  properly  S  had  to 
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integrate  his  estimates  of  the  speed  of  each  disk,  the  pointer  and  his  o>m 
control  movements.  Scores  recorded  were  number  of  pointer  revolutions  and 
number  of  errors  (crossing  of  black  areas)  during  four  1 -minute  trials 
separated  by  15-second  rests. 

d)  The  Complex  Coordination  Test  is  a  measure  of  multilimb  coordination 
which  represented  the  ability  of  a  S  to  make  coordinated,  simultaneous  responses 
of  several  limbs  in  operating  controls.  Patterns  of  lights  were  presented  whose 
positions  were  to  be  matched  by  appropriate  adjustments  of  stick  and  rudder  con¬ 
trols.  A  correct  response  was  accomplished  only  when  both  the  hands  and  feet 
had  completed  and  maintained  the  appropriate  adjustments,  at  which  point  a  new 
pattern  of  lights  to  be  matched  were  presented.  The  scores  are  the  number  of 
completed  matchings  during  a  four  and  a  ten  minute  period. 

Table  2  presents  the  predictor  means,  standard  deviations  and  tests  of 
significance  of  mean  differences  for  the  taxicab  driver  sample.  The  white  sub¬ 
group  committed  significantly  more  errors  on  the  AGET  than  the  Negro  subgroup. 

On  the  Motor  Judgment  Test  the  white  drivers  had  more  errors  but  completed  more 
total  pointer  revolutions  than  the  Negro  sample.  There  were  no  other  signifi¬ 
cant  predictor  mean  differences  between  the  white  and  Negro  subgroups. 

Criteria 

Three  measures  of  driving  performance  were  used  as  criterion  measures. 

Street  observations  of  actual  driving  behavior  were  made  by  two  independent  ob¬ 
servers.  The  drivers  were  not  informed  that  they  were  being  observed.  This  was 
accomplished  by  having  the  observers  ride  in  each  cab  from  dovntown  Washington 
to  Silver  Spring,  Maryland  as  if  they  were  normal  passengers.  When  the  vehicle 
arrived  at  the  AIR  office  in  Silver  Spring,  the  drivers  vere  informed  of  the  re¬ 
search  project.  However,  they  were  not  told  that  their  driving  behavior  had 
been  systematically  observed.  During  the  trip,  each  observer  Independently 
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Table  2:  Predictor  Means,  Standard  Deviations,  H's  and  Tests 
of  Significance  of  Mean  Differences  -  Taxicab  Drivers 


Predictor 

Group 

X 

s 

N 

t 

AGDT  - 

Total 

45.  44 

10.65 

300 

Total 

Incorrect 

White 

47. 07 

9-75 

150 

Negro 

43.81 

11.27 

150 

2.67** 

Drivotron  - 

Total 

42.95 

5.80 

303 

Total 

White 

42.78 

5.87 

152 

Negro 

43.ll 

5-74 

151 

0.49 

Drivotron  - 

Total 

4.93 

1.60 

303 

Starbursts 

White 

4.90 

1.59 

152 

Negro 

4-95 

1.61 

151 

0.27 

Drivotron  - 

Total 

1 

9.93 

1.96 

303 

Steering 

White 

9.93 

1.89 

152 

Negro 

9.94 

2.04 

151 

0.04 

Drivotron  - 

Total 

13*34 

3.26 

303 

Speed 

White 

13*61 

3.19 

152 

Negro 

13.08 

3.33 

151 

1.41 

Drivotron  - 

Total 

10.41 

2.77 

303 

Brake 

White 

IO.36 

2.63 

152 

Negro 

10. 

2.92 

151 

0.31 

Drivotron  - 

Total 

1.24 

0.95 

303 

Signal 

White 

1.24 

0.95 

152 

Negro 

1.24 

0.96 

151 

0.00 

Tk*le  2  (Contd) 


Predictor 

Group 

X 

8 

K 

t 

Simple  Reaction 

Total 

232.75 

53-99 

299 

Time 

White 

236.02 

42.56 

151 

Kegro 

229.41 

63.55 

148 

1.06 

Jwp  Reaction 

Total 

540.4a 

95.06 

300 

flaw 

White 

541.29 

91.81 

152 

Kegro 

539-48 

96.60 

148 

1.60 

Motor  Judgment 

Total 

43.83 

27.81 

301 

Total  Errors 

White 

47.03 

33-46 

150 

Kegro 

40.66 

20.34 

151 

1.99* 

Motor  Judgment 

Total 

53.05 

19.49 

301 

Total 

Revolutions 

White 

56.98 

23.14 

150 

Kegro 

49.14 

14.03 

151 

3.55** 

Complex  Coord. 

Total 

6.96 

5.62 

299 

4  min. 

White 

9.40 

6.14 

151 

Kegro 

8.52 

5.01 

148 

1.35 

Complex  Coord. 

Total 

25.71 

14.92 

299 

10  min. 

White 

26.74 

16.14 

151 

Kegro 

24.66 

13.55 

148 

1.20 

*  P  <  .05 


**  p  <  .01 
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recorded  errors  Bade  by  the  driver,  using  a  partially  pre-coded  check  list 
which  had  been  developed  on  the  basis  of  previous  research  (Sdvards  and  Hahn, 
1964).  The  fom  yielded  a  total  error  score  which  was  used  in  the  present 
analysis. 

The  total  nunber  of  moving  violations  with  which  each  driver  had  been 
charged  and  the  total  nunber  of  accidents  in  which  each  driver  had  been  in¬ 
volved  were  also  used  as  criterion  measure s.  These  data  were  obtained  from 
the  official  records  of  the  Department  of  Motor  Vehicles. 

As  Indicated  in  Table  3,  the  Negro  drivers  haul  been  Involved  in  more  acci¬ 
dents  and  had  been  charged  with  more  moving  violations  than  the  white  subgroup. 
There  was  no  significant  difference  between  the  two  subgroups  on  the  number  of 
street  errors  recorded  by  the  two  observers. 

Validity 

Predictor-criterion  correlations  are  presented  in  Table  4.  Generally,  low 
validity  was  found  between  the  various  predictors  and  criteria.  The  validity 
coefficients  in  Table  4  were  corrected  for  age  by  partial  correlation  techniques, 
lb  predictor  was  valid  for  more  than  a  single  criterion.  The  observed  Btreet 
error  criterion  was  slightly  more  predictable  than  recorded  accidents  on  moving 
violations. 

Models  Illustrated 

An  examination  of  these  data  led  to  several  cases  vhlcb  illustrated  the 
Bartlett  and  O'Leary  models  (1969)*  They  are: 

a)  Two  examples  of  Kodel  b  which  represents  the  situation  in  which 
there  are  no  mean  differences  on  either  predictor  or  criterion 
measures  and  significant  validity  for  only  the  white  subgroup. 
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Table  3:  Criterion  Means,  Standard  deviations,  H'  a,  and  Tests 
of  Significance  of  Mean  Differences  -  Taxicab  Drivers 


Criterion 

1 

Group 

X  1 

s 

R 

t 

Total  Street 

Total 

22.57 

8.36 

303 

Errors 

White 

23-  34 

9.09 

152 

Regro 

21.61 

7.50 

151 

1.59 

Accidents 

Total 

1.1*2 

1.14 

303 

White 

1.23 

0.95 

152 

Regro 

1.60 

1.28 

151 

2.85** 

Violations 

Total 

3.11 

2.76 

303 

White 

2.48 

2.62 

152 

Regro 

3-74 

2.76 

151 

4.06** 

**  p  <  .01 
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Table  4:  Predictor  -  Criterion  Correlation*  -  Total  Group, 


White  and  Negro  Subgroups 
Taxicab  Drivers^  ^ 


Criterion 

Predictor 

■ 

Group 

Street  Errors 

1  Accidents 

Violations 

' 

r 

i 

N 

r 

* 

r 

N 

AGDT  - 

Total 

-04 

300 

-17** 

300 

00 

300 

Total 

Incorrect 

White 

-07 

150 

-21** 

150 

01 

150 

Negro 

-04 

150 

-12 

150 

07 

150 

18) 

i 

Drivotron  - 

Total 

00 

303 

06 

303 

-03 

303 

Total 

White 

03 

15 2 

01 

152 

-01 

152 

Negro 

-03 

151 

09 

151 

-06 

151 

Drivotron  - 

Total 

rH 

O 

1 

303 

-08 

303 

04 

303 

Starbursts 

White 

-07 

152 

-09 

152 

08 

152 

Negro 

08 

151 

-09 

151 

00 

151 

Drivotron  - 

Total 

01 

303 

f 

O 

v/> 

303 

-02 

303 

Steering 

White 

-04 

152 

07 

152 

00 

152 

Negro 

06 

151 

-16* 

151 

-04 

151 

(7) 

Drivotron  - 

Total 

04 

303 

-03 

303 

1 

O 

UJ 

303 

Speed 

White 

-01 

152 

-04 

152 

02 

152 

Negro 

08 

151 

00 

151 

1 

O 

KjJ 

151 

Drivotron  - 

Total 

-04 

303 

-08 

303 

02 

303 

Brake 

White 

-01 

152 

-05 

152 

03 

152 

Negro 

-06 

151 

-11 

151 

00 

151 

ignal 


Total 

White 

Negro 


Simple  Reaction |  Total 
White 
Negro 


Jump  Reaction  Total 
Ime 

White 


Negro 


Total  Errors 


Motor  Judgment 
Total 

Revolutions 


Complex  Coord 


Complex  Coord 
10  min. 


Table  4  (Contd) 


Street  Errors 


Accidents 


Violations 


303 

09 

303 

-02 

303 

152 

09 

152 

-04 

152 

151 

09 

151 

00 

151 

06  300 

01  152 


Total 

-02 

301 

-10 

301 

07 

301 

White 

-06 

150 

-08 

150 

16* 

150 

Negro 

02 

151 

-10 

151 

01 

151 

(8) 

Total 

03 

301 

-10 

301 

05 

301 

White 

01 

150 

-07 

150 

12 

150 

Negro 

02 

151 

-08 

151 

07 

151 

Total 

16** 

299 

-12 

299 

-07 

299 

White 

15 

151 

-05 

151 

-02 

151 

Negro 

15 

148 

-13 

148 

-10 

148 

Total 

17** 

299 

-13 

299 

-06 

299 

White 

18* 

151 

-08 

151 

-01 

151 

Negro 

13 

(5) 

148 

-12 

148 

-08 

148 

(l)  The  rasher  in  parentheses  beneath  the  Negro  group  correlation 

indicates  the  model  illustrated  (after  Bartlett  and  O'Leary,  1969) 


b)  A  single  example  of  Model  7  was  observed  In  the  taxicab  driver 
data.  Model  7  illustrates  the  situation  in  which  there  is  a  mean 
criterion  difference  and  validity  for  only  the  Begro  subgroup. 

c)  Two  examples  of  Model  8  were  illustrated  by  the  predictor-criterion 
relationships.  This  model  is  representative  of  the  situation  in 
which  there  are  significant  subgroup  mean  differences  in  both  pre¬ 
dictor  and  criterion  performance  but  significant  validity  for  only 
one  subgroup.  It  is  interesting  to  note  that  only  a  single  pre¬ 
dictor  variable  exhibited  significant  validity  with  the  Begro  sub¬ 
group  despite  the  fact  that  the  two  subgroups  were  of  approximately 
equal  size. 

Multiple  Regression  Analyses 

A  stepwise  regression  procedure  vas  used  to  select  predictors  from  the 
total  13  available  for  the  multiple  regression  analyses  using  the  criterion 
developed  by  Lord  (1950).  The  results  of  these  analyses  for  the  total  group 
and  the  two  racial  subgroups  are  shown  in  Table  J>.  For  the  total  group  the 
multiple  regression  equation  was  developed  on  a  random  back  sample  of  200  cases 
and  cross  validated  on  a  sample  of  9h  cases.  For  the  racial  subgroups  the 
multiple  regression  equations  vere  developed  on  back  samples  of  100  random  cases 
each  and  cross  validated  on  a  sample  of  47  cases  each.  Bone  of  the  relation¬ 
ships  found  in  the  back  samples  held  up  in  the  cross  validation  groups.  In 
general,  different  predictors  tended  to  be  chosen  for  the  multiple  regression 
equations  in  the  two  racial  subgroups. 

Table  6  presents  the  results  of  multiple  regression  analyses  conducted 
using  the  total  sample  (H  =  29*0  as  the  back  sample  on  which  the  equations  were 
developed  and  the  racial  suhgroups  as  cross  validation  samples  (B  *  147  for 
both  subgroups).  As  can  be  seen  In  Table  b,  the  multiple  correlations  held 
up  in  three  of  the  six  instances.  The  success  of  these  regression  equations 
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Table  5:  Multiple  Regression  Analyses  -  Equations, 
Back  Semple  R's  and  Cross-Validated  R's 


Group 

Criterion 

Eauation 

Total 

Street  Errors 

T  =  2A.55  -.166  X2  +.216  X5  +  .076  Xia 
rBACK  *  -21  (*"200);  Rc.v  -  .06  t»-9A) 

Accidents 

y  =  2.676  -  .016  xi  -  .072  x3  -  .009  xl3 

RBACK  “  *23  (*-200);  Re_T  ■  .19  (**9*) 

Violations 

I  =  I.89A  +  .006  Xfl  -  .215  Xjg  +  .070  X23 

RBACK  *  («00)i  Rc-v  "  *02  (E-94) 

White 

Street  Errors 

Accidents 

y  =  29.806  -  .227  Xg  -  1.235  Xig  +  .535  x„ 
rBACK  -  -28  (WOO);  Rc.v  =  .05  (H-A7r 

y  =  1.723  -  .024  xi  +  .060  x* 

rBACK  a  .27.  (WOO;  Rc.v  *  .11  (N=47) 

Violations 

y  «  -.OOA  -  .206  X4  +  .127  X5  ♦  .008  Xg  +  .017  X10 
rBACK  -  -33  (B-1Q0);  Rc.;  «  .16  (1-A7) 

Negro 

Street  Errors 

Accidents 

y  =  11.438  +  .3A0  X5  -  .338  X$  +  .028  X7  +  .3A6  Xjg 
rBACK  *  *38  (R-lOO);  Rc.y  »  .02  (*=47) 

y  =  2.892  -  .094  X|*  -  .046  Xj2 

rBACK  *  -26  (If- 100);  Rc_v  »  .01  (*-47) 

Violations 

y  =  0.476  -  .299  x7  +  .004  Xo  -  .066  x,c  +  .086  X„ 
rback  =  -30  (iwoo);  Rc-v  *  *°5  (*»T) 

Scrte :  X^  =  AGDT,  Total  Incorrect;  Xg  =  Drivotron,  Total; 

X3  *  Drivotron,  Starbursts;  X^  =  Drivotron,  Steering; 

X5  =  Drivotron,  Speed;  X&  =  Drivotron.  Brake; 

Xj  *  Drivotron,  Signal;  Xg  *  Simple  Reaction  Time; 

X9  *  Juap  Reaction  Time;  X.0  *  Motor  Judgment,  Total  Errors; 

Xj2  *  Motor  Judgment,  Total  Revolutions;  Xjg  “  Complex  Coordination, 
A  min. ;  X^  =  Complex  Coordination,  10  min. 


IT 


Table  6;  Multiple  Regression  Analyses  Based  upon 
Total  Saaple  (11*294)  and  Cross  Validated  on 
Racial  Subgroups  ( IN  147  for  both  White  and  ICegro  Subgroups) 


R 

R 

R 

Criterion 

Total  Saaple  Equation  1  Total  Saaple 

White  C-V 

negro  C-V 

Street  Errors 

r  =  21.297  -  .637  X- 
+  .083  x13  ' 

.18 

.18* 

-.13 

Accidents 

Y  *  2.739  -  *017  Xx 

-  .006  Xu  -  .008  X13 

•23 

.22* 

.18* 

Violations 

Y  a  3.447  -  .038  Xu 

.08 

rH 

O 

a 

1 

.05 

*  P  <  -05 

Mote:  X^ig  refer  to  saute  variables  as  described  in  Table  5. 
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as  compared  to  those  developed  in  the  subgroups  separately  is  probably  doe  to 
two  factors,  vis.,  the  greater  stability  in  the  beta  weights  due  to  Increased 
back  sample  site  and  the  fact  that  the  cross  validation  samples  were  included 
as  part  of  the  back  sample. 
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Study  #2:  North  Carolina  Students: 

The  Prediction  of  Success  for  Ninth  and  Twelfth  Grade  Students 


Sample 

Academic  data  from  a  stratified  random  sample  of  48  classes  in  the 
North  Carolina  public  school  system  were  obtained  from  the  Institute  for 
Behavioral  Research  in  Creativity.1  A  large  number  of  these  classes  were 
not  racially  Integrated.  Thus ,  rather  than  analyze  the  data  for  the  total 
group  vhich  would  include  segregated  classes,  the  authors  have  chosen  to 
present  data  on  two  racially  integrated  classes  -  a  ninth  grade  class  and 
a  twelfth  grade  class.2  The  ninth  grade  class  consisted  of  221  students, 

166  white  and  55  Negro;  while  the  twelfth  grade  class  was  composed  of  245 
whites  and  58  Negroes. 

Predictors 

Test  scores  on  the  California  Test  of  Mental  Maturity,  a  measure  of 
general  mental  ability,  were  obtained  for  both  the  ninth  and  twelfth  grade 
samples.  Also  Included  as  predictors  were  four  keys  of  the  Alpha  Biographical 
Inventory  developed  by  the  Institute  for  Behavioral  Research  in  Creativity. 
This  Inventory  consists  of  300  items  about  childhood  activities,  experiences, 
sources  of  derived  satisfaction  and  dissatisfaction,  academic  experiences, 
attitudes  and  Interests,  value  preferences,  and  self -description  evaluations. 
The  four  keys  were  Rank  in  Class,  Grade  Point  Average,  Leadership,  and  Creat¬ 
ivity.  The  Rank  in  Class  key  was  not  available  for  the  ninth  grade  sample. 


(1)  The  authors  would  like  to  thank  Robert  Lacklen  of  the  Richardson  Founda¬ 
tion  and  Robert  Ellison  of  the  Institute  for  Behavioral  Research  in 
Creativity  for  providing  these  data. 

(2)  An  analysis  of  the  total  sample  has  recently  been  reported  by  Ellison, 
James,  Fox,  and  Taylor  (1970). 
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The  total  Maple  was  divided  into  approximately  equal  groups  based  on  an  odd- 
even  split.  The  odd  cases  were  used  to  develop  the  Rank  In  Class  Key  while 
the  even  cases  were  used  as  a  cross-validation  sample.  The  Grade  Point  Average 
Key,  the  Leader  ah'*  p  Key,  and  the  Creativity  Key  had  been  previously  developed 
and  thus  the  validity  data  presented  are  examples  of  validity  generalization. 

Table  7  presents  the  means,  standard  deviations,  and  tests  of  signifi¬ 
cance  between  means  for  the  two  racial  groups  in  the  ninth  grade  sample. 

White  students  scored  higher  than  Regro  students  on  the  California  Test  of 
Mental  Maturity.  There  was  no  difference  between  the  mean  performance  of  the 
two  racial  groups  on  any  of  the  keys  developed  on  the  Alpha  Biographical  In¬ 
ventory. 

Mean  scores  for  the  two  racial  groups  in  the  twelfth  grade  sample  are 
presented  in  Table  8.  As  in  the  ninth  grade  sample,  white  students  scored 
higher  than  Regro  students  on  the  California  Test  of  Mental  Maturity.  More¬ 
over,  white  students  scored  higher  than  Regro  students  on  the  Grade  Point 
Average  Key,  the  Leadership  Key,  and  the  Creativity  Key  of  the  Alpha  Biogra¬ 
phical  Inventory.  There  was  no  difference  between  the  mean  scores  of  the  two 
groups  on  the  Rank  in  Class  Key. 

Criteria 

The  following  different  criterion  measures  were  obtained:  Classroom  grades 
in  English,  math,  science,  social  studies,  and  a  total  Grade  Point  Average. 

Also  for  the  twelfth  grade  sample  Rank  in  Class  (position  divided  by  class 
size)  was  computed. 

Two  teacher  ratings  vere  obtained  on  all  students.  The  first,  a  Leader¬ 
ship  rating,  was  an  average  of  the  student's  ratings  on  the  following  traits: 
Participation  in  class  discussion.  Initiative,  Dominance,  Initiative  of  dis¬ 
ruptive  activities.  Acceptance  by  others.  Consideration,  Interest  in  getting 
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Table  7:  Predictors:  Means,  Standard  Deviations  and  M's 
Ninth  Grade  Students 


Predictor 

Group 

X 

s 

N 

t 

California  Test 

Total 

103.60 

15.47 

202 

of  Mental 
Maturity 

White 

106.32 

15.28 

157 

Negro 

94.11 

12.15 

45 

4.91** 

Alpha  Biographical 
Inventory 


Average  GPA 

Key 

Total 

White 

Negro 

102. 51 

103.44 

1 

99-71 

21.86 

22.22 

20.68 

221 

166 

55 

1.09 

leadership  Key 

Total 

102.03 

22.76 

221 

White 

102.69 

23.46 

166 

Negro 

100.05 

20.56 

55 

.74 

Creativity  Key 

Total 

102.28 

24.65 

221 

White 

102.98 

25.36 

166 

Negro 

100.18 

22.47 

55 

.73 

**  p  <  .01 
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California  Test 
of  Mental 
Maturity 


Total 

White 

Negro 


105.11 

107.71 

90.33 


14.15 

13.22 

9.46 


Rank  in  Class  Key  |  Total 


Average  GPA  Key  |  Total 


Leadership  Key  | Total 


Creativity  Key  | Total 


Total 

99.44 

20.31 

303 

White 

90.73 

20.97 

245 

Negro 

102.47 

17.32 

58 

Total 

102.19 

18.75 

303 

White 

103.55 

19.22 

245 

Negro 

96.48 

15.51 

58 

Total 

104.17 

20.42 

303 

White 

105.61 

20.73 

245 

Negro 

96.10 

17.97 

58 

Total 

104.30 

22.06 

303 

White 

105.85 

22.41 

245 

Negro 

97.74 

19.36 

58 

an  education,  Involvement  in  personal  academic  activities.  The  second,  a 
creativity  rating,  was  an  average  of  the  students  ratings  on  the  following 
traits:  Generalisation  of  new  ideas,  Jeeks  new  solutions.  Independence, 
Defends  own  ideas,  Quality  and  quantity  of  questions.  Originality,  Enjoy¬ 
ment  of  complex  problems.  Participation  in  open-ended  discussions.  Interest 
in  outside  actlvites. 

Mean  criterion  data  for  the  ninth  grade  sample  is  presented  in  Table  9* 
White  students  obtained  significantly  higher  grades  than  Negro  students  in 
English,  oath,  and  social  studies  as  well  as  the  overall  Grade  Point  Average. 
There  was  no  difference  between  the  mean  grades  of  the  two  racial  groups  In 
science.  White  students  also  received  higher  teacher  ratings  on  Leadership 
and  Creativity  than  Negro  students. 

Table  10  presents  the  mean  criterion  data  for  the  twelfth  grade  sample. 
White  students  obtained  higher  grades  and  teacher  ratings  than  Negro  students 
on  all  the  criterion  measures . 

Validity 

Correlations  between  the  criterion  and  predictor  measures  for  the  ninth 
grade  sample  are  presented  in  Table  11.  All  predictors  correlated  signific¬ 
antly  with  the  criterion  measures  for  both  racial  groups.  The  relationship 
between  the  California  Test  of  Mental  Maturity  and  English  grades  was  the 
only  instance  where  a  difference  in  the  validity  occurred  between  the  racial 
groups.  The  Negro  sample  validity-coefficient  was  significantly  greater 
than  the  white  sample  coefficient.  However,  botn  coefficients  were  signifl~ 
cantly  related  to  the  criterion. 

Correlations  between  the  predictors  and  criterion  for  the  twelfth  grade 
sample  are  presented  in  Table  12.  All  of  the  predictors  correlated  signifi- 
cant  with  the  Rank  in  Class  criterion  and  teacher  ratings  of  Creativity  for 
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Table  9:  Criteria:  Meane,  Standard  Deviations  and  I's 
ninth  Grade  flanple 


Criterion 

Group 

X 

s  1  N 

t 

Grades 

English 

Total 

23.56 

9.77 

219 

White 

25.37 

9.b9 

16b 

Jfegro 

18.18 

8.63 

55 

b.95** 

Math 

Total 

21.82 

10.6b 

219 

White 

23.05 

10. b7 

16b 

Negro 

18.18 

IO.38 

55 

2.98** 

Science 

Total 

22.2k 

11.77 

219 

White 

23. 11 

12.16 

16b 

Negro 

19.6b 

10.18 

55 

1.90 

Social 

Total 

23.06 

9*87 

219 

Studies 

White 

2b. 27 

9.66 

16b 

Negro 

19.^5 

9.70 

55 

3.18** 

Average  Grade 

Total 

22.89 

9.08 

219 

Point 

White 

2b.  17 

8.9I 

16b 

Negro 

19.07 

8.60 

55 

3.69** 

Teacher  Ratings 


Leadership 

Total 

29.73 

7.26 

219 

White 

30.92 

7.02 

16b 

Negro 

26.13 

6.8b 

55 

Creativity 

Total 

25.18 

9.59 

219 

White 

26.62 

9.bb 

16b 

Negro 

20.82 

8.76 

55 

b.00** 

**  p  <  .01 


Table  10:  Criteria:  Means,  Standard  Deviations  and  M's 
Twelfth  Grade  Saaple 


Criterion 

Group 

X 

8 

M 

t 

Rank  in  Class 

Total 

44.35 

28.41 

299 

White 

41.26 

27.87 

244 

Megro 

58.07 

26.89 

55 

4.05** 

Grades 

Ei^lish 

Total 

19.93 

7.94 

299 

White 

20.99 

7.76 

243 

Megro 

15-36 

7.13 

56 

4.95** 

Math 

Total 

20.61 

11.26 

244 

White 

21.33 

11.37 

210 

Megro 

16.18 

9.54 

34 

2.49* 

Science 

Total 

17.84 

10.25 

White 

19.15 

10.21 

Megro 

12.35 

8.55 

Kfl 

3.57** 

Social 

Total 

21.13 

10.16 

293 

Studies 

White 

22.01 

10.42 

239 

Megro 

17.22 

7.87 

54 

3-76* 

Average  Grade 

Total 

19.87 

8.29 

300 

Point 

White 

20.91 

8.32 

244 

Megro 

15.34 

6.52 

56 

5.42** 

Teacher  Ratings 


Leadership 

Total 

White 

1  *«««> 

29-59 

30.  uo 

26.1b 

6.87 

6.83 

6.00 

303 

245 

58 

4.33** 

Creativity 

Total 

27.94 

8.44 

303 

White 

29.06 

8.34 

245 

Megro 

23.21 

7.17 

58 

4.91** 

» 

*♦ 


P  < 
P  < 
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Table  11:  Predictor  **  Criterion  Correlations 
Total  Group,  Whites  and  Negroes 
Ninth  Grade  Sample^*  2) 


Predictor 

Criterion 

Alpha  Biographical  Key 

1  CRM 

1  G.P.A.  )  Leadership  1  Creativity  1 

Grades 

N 

\ 

N  j 

Kngllsh 

; 

Total 

48** 

201 

55** 

46** 

47** 

219 

White 

38** 

156 

53** 

43** 

45** 

164 

Negro 

65**S 

45 

64** 

61** 

61** 

55  j 

(1) 

(3) 

(3) 

(3) 

J 

Nath 

Total 

55** 

201 

55** 

50** 

-50** 

219 

White 

52** 

156 

53** 

48** 

49** 

164 

Negro 

54** 

45 

64** 

51** 

47** 

55 

(1) 

(3) 

(3) 

(3) 

Science 

Total 

53** 

201 

68** 

62** 

.64** 

219 

White 

52** 

156 

70** 

63** 

.64** 

164 

Negro 

47** 

*5 

61** 

58** 

60** 

55 

(2) 

Social 

Total 

42** 

201 

61** 

55** 

56** 

219 

Studies 

White 

<4** 

156 

59** 

52** 

54** 

164 

Negro 

5b** 

*5 

69** 

65** 

66** 

55 

(1) 

(3) 

(3) 

(3) 

Average  Grade 

Total 

58** 

201 

70** 

62** 

63** 

219  1 

Point 

White 

52** 

156 

70** 

62** 

63** 

164 

Negro 

6 2** 

45 

73** 

69** 

.69** 

55  i 

(1) 

(3) 

(3) 

j 

Teacher  Ratings 


Leadership 

Total 

White 

Negro 

58** 

54** 

52** 

(1) 

2  02 
157 
45 

72** 

70** 

82** 

(3) 

71** 

71** 

79** 

(3) 

71** 

70** 

79** 

(3) 

221 

166 

55 

Creativity 

Total 

64** 

202 

64** 

64** 

64** 

221 

White 

62** 

157 

62** 

64** 

64** 

166 

Negro 

53** 

(1) 

45 

71** 

(3) 

69** 

(3) 

70** 

(3) 

55 

(1)  Dec laais  are  onitted. 

(2)  Number  in  parentheses  below  the  correlation  for  the  Negro  sample  indicates 
the  model  illustrated. 

*  P  -05 

**  p  .01 

#  Indicates  those  aodels  in  which  a  significant  difference  exists  between 
the  validity  coefficients  for  the  two  ethnic  groups. 
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Table  12:  Pradietor  -  Criterion  Correlations 


Total  Group,  Whites  aid  Negroes 


Twelfth  Grade  taple(l>  2) 


Predictor 

Criterion 

Alpha  Biographical  Inventory 

CUM 

n 

!  Rank  in  Class 

G.P.A. 

V 

Rank  In  Class 

Total 

-  54** 

238 

67** 

-  69** 

299 

White 

-  53** 

20k 

71** 

-  72** 

24k 

Negro 

-  55** 

34 

5o**e 

-  47*«8 

55 

(1) 

(3) 

(1) 

Grades 


English 

Total 

50** 

237 

-  54** 

57** 

299 

White 

50** 

203 

-  61** 

-62** 

243 

Negro 

158 

34 

-  ITS 

188 

56 

(8) 

(6) 

(6) 

Ifath 

2d 

198 

-  43** 

45** 

■FT1 'Hi 

White 

26 

178 

-  46** 

47** 

210 

Negro 

38 

20 

-  126 
(6) 

(8* 

34 

Science 

Total 

U2** 

138 

-  59** 

59** 

176 

White 

39** 

115 

-  63** 

6e** 

142 

Negro 

27 

23 

-  39* 

36* 

34 

(8) 

(3) 

(1) 

Social 

Total 

msa 

233 

-  61** 

62** 

293 

Studies 

White 

201 

-  66** 

fCm  a 
UO"" 

239 

Negro 

32 

_ 

-  229 

228 

54 

Bui 

(6) 

(8) 

Average  Grade 

Total 

51** 

238 

-  63** 

65 

300 

Point 

White 

50** 

204 

-  69** 

69** 

244 

Negro 

34** 

34 

-  28«e 

27*8 

56 

_ 

(1) 

(3) 

(1) 

Predictor 

Alpha  Biographical  Inventory 

Criterion 

Leadership 

Creativity 

N 

Rank  in  Class 

Total 

-  58** 

-  59** 

299 

White 

-  60** 

-  61** 

244 

Negro 

-  W 

Mt 

-  1*2** 

55 

(1) 

(1) 

2  8 


Table  12  (Contd) 


Criterion 

Predictor 

Alpha  Biographical  Inventory 

)  leadership  1  Creativity  1  If 

Grades 

English 

Total 

White 

Ifegro 

50** 

55** 

15® 

(8) 

52** 

57** 

158 

(8) 

299 

*3 

56 

Math 

Total 

3k** 

3k** 

2kk 

White 

36** 

36** 

210 

Ifegro 

07® 

(8) 

078 

(8) 

3k 

Science 

Total 

5k** 

55** 

176 

White 

55** 

56** 

lk2 

llegro 

36* 

(1) 

3k* 

(1) 

3k 

Social 

Total 

56** 

56** 

293 

Studies 

White 

60** 

61** 

239 

Ifegro 

19@ 

(8) 

178 

(8) 

5k 

Average  Grade 

Total 

56** 

57** 

300 

Point 

White 

60** 

61** 

2kk 

Ifegro 

2k® 

(8) 

238 

(8) 

56 

rt-edlctor 


Criterion 


Alpha  Biographical  Inventory 


I  CTMM  I  If  I  Rank  in  Class  I  G.P.A. 


Teacher  Ratings 


Leadership 

Total 

White 

Ifegro 

33** 

29** 

29 

(8) 

2kl 

205 

36 

-  50** 

-  52** 

-  39** 

(8) 

50** 

51** 

36** 

(1) 

303 

2k5 

58 

Creativity 

Total 

i»2** 

2kl 

-  50** 

50** 

303 

White 

36** 

205 

-  52** 

51** 

2k5 

Ifegro 

kk** 

(1) 

36 

-  37** 

(3) 

3k** 

(1) 

58 

29 


I 


I 

1 


Table  12  (Contd) 


Criterion 

Predictor 

Alpha  Biographical  Inventory 

Leadership 

Creativity 

N 

Teacher  Ratings 

leadership 

Total 

55** 

55** 

303 

White 

55** 

55** 

245 

Negro 

44** 

1*3** 

58 

(1) 

(1) 

Creativity 

Total 

55** 

55** 

303 

White 

56** 

56** 

245 

Negro 

1*2** 

1*1** 

58 

(1) 

(1) 

(1)  Decimals  are  omitted. 

(2)  lumber  in  parentheses  below  the  correlation  for  the  Negro  sample 
Indicates  the  model  Illustrated. 

*  P  <  .05 

**  p  <  .01 

8  Indicates  those  models  In  vhich  a  significant  difference  exists  between 
the  validity  coefficients  for  the  two  ethnic  groups. 
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both  racial  groups.  In  two  cases,  the  relationship  between  Rank  in  Class 
and  the  Rank  in  Class  key  and  the  Grade  Point  Average  key,  the  validity 
coefficients  for  the  white  sample  were  significantly  higher  than  those  for 
the  Megro  sample.  English  and  math  grades  were  predictable  for  the  white 
sample  but  not  for  the  Megro  sample. 

The  California  Test  of  Mental  Maturity  predicted  social  studies  grades 
and  Grade  Point  Average  for  both  racial  groups.  However,  social  studies 
grades  and  Grade  Point  Average  were  predictable  only  for  the  white  sample 
using  the  Leadership  and  Creativity  keys  of  the  biographical  inventory.  The 
Rank  in  Class  key  and  the  Grade  Point  Average  key  predicted  Grade  Point  Average 
for  both  racial  groups;  however,  the  white  sample  validity  coefficients  were 
significantly  greater  than  those  for  the  Megro  sample. 

Science  grades  were  predictable  for  the  white  sample  but  not  for  the 
Megro  sample  using  the  California  Test  of  Mental  Maturity  as  the  predictor. 

When  the  biographical  keys  are  used  as  predictors,  the  science  grades  are 
equally  predictable  for  the  two  racial  groups. 

Teacher  ratings  of  Leadership  were  predictable  for  both  racial  groups 
using  the  biographical  keys  but  were  only  predictable  for  the  white  sample 
when  the  California  Test  of  Mental  Maturity  is  used  as  the  predictor. 

Models  Illustrated  -  Minth  Grade 

a)  An  analysis  of  the  predictor- criterion  relationships  for  the  ninth 
grade  sample  Indicates  that  eighteen  cases  of  Model  3  were  Illustrated.  (The 
number  in  parentheses  below  the  correlation  for  the  Megro  sample  in  Table  11 
Indicates  the  model  Illustrated).  In  all  Illustrations  of  this  model  white 
students  scored  higher  than  Negro  students  on  the  criterion  measures  but  there 
was  no  difference  between  the  two  groups  on  the  predictors.  The  test  was  valid 
for  both  racial  groups.  In  every  illustration  of  this  model,  the  use  of  a 
single  regression  line  or  expectancy  table  would  result  in  over-prediction 
of  criteria  performance  for  the  Negro  sample. 
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b)  Six  illustrations  of  Model  1  were  illustrated  in  the  relationship 

l 

between  the  California  Test  of  Mental  Maturity  and  the  criterion  Manures. 

White  students  scored  higher  on  both  the  predictor  and  criterion  and  the  test 
vas  valid  for  both  racial  groups.  In  this  model  the  difference  in  predictor 

j 

performance  is  associated  with  a  difference  in  criterion  performance,  thus  j 

bias  in  the  test  is  not  present  for  predicting  this  criterion. 

c)  Only  one  other  model  was  represented  in  the  ninth  grade  sample. 

Model  2  was  illustrated  in  the  relationship  between  the  California  Test  of 

i 

I 

Mental  Maturity  and  Science  grades.  Although  there  was  a  difference  between 
the  Man  performance  of  the  two  racial  groups  on  the  predictor,  there  was  no 
difference  in  the  criterion.  The  test  was  valid  for  both  racial  groups.  The 
use  of  a  single  regression  line  would  result  in  over-prediction  of  criterion 
performance  for  vhlte  students  and  under-prediction  for  Negro  students. 

The  regression  tests  for  the  analysis  of  covariance  (Potthoff,  1969)  are 
presented  in  Table  13 .  This  analysis  Indicated  that  there  was  no  difference 
in  the  slope  of  the  regression  line  for  the  two  racial  groups  in  any  of  the 
predictor-criterion  relationships.  However,  in  all  but  six  of  the  relation¬ 
ships  there  was  a  difference  in  the  intercepts  for  the  two  racial  groups. 

Models  Illustrated  -  Twelfth  Grade 

Two  models.  Model  1  and  Model  6,  occurred  quite  frequently  in  the  twelfth  I 

grade  sample.  j 

a)  Model  1  is  illustrated  in  the  relationship  between  the  Grade  Point 
Average,  Leadership,  and  Creativity  Keys  of  tt  »  Alpha  and  the  following 
criteria:  Rank  in  Class,  Science  grades.  Grade  Point  Average,  and  ratings 
of  Leadership  and  Creativity.  In  all  illustrations  of  this  model  white  students 
scored  higher  than  Negro  students  on  both  the  predictor  and  criterion  and  the 
test  was  valid  for  both  racial  groups.  Since  the  difference  in  predictor 
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performance  is  paralleled  by  a  corresponding  difference  in  criterion  performance 
the  test  is  not  biased  against  either  subgroup. 

b)  Model  8  is  illustrative  of  the  situation  where  there  is  a  difference 
in  both  predictor  and  criterion  performance  but  the  test  is  valid  only  for  one 
subgroup.  This  Model  was  illustrated  in  the  relationship  between  the  Grade 
Point  Average,  Leadership,  and  Creativity  keys  of  the  Alpha  Biographical  Inven¬ 
tory  and  English,  math,  and  social  studies  grades,  as  well  as  Grade  Point 
Average.  In  all  illustrations  of  this  model  the  test  is  valid  only  for  the 
white  group,  and  white  students  score  higher  on  both  the  predictor  and  criteria. 
Valid  predictions  can  be  made  since  the  white  group  scored  higher  on  both  the 
predictor  and  criterion.  However,  the  test  is  clearly  not  appropriate  for  the 
Negro  sample. 

c)  Two  other  models  were  also  illustrated  in  the  twelfth  grade  sample. 

The  relationship  between  the  Bank  in  Class  key  of  the  Alpha  Biographical  Inven¬ 
tory  and  the  criteria,  Rank  in  Class,  science  grades,  Grade  Point  Average,  as 
well  as  the  Leadership  and  Creativity  ratings  illustrate  Model  3.  There  was 

no  difference  in  the  mean  scores  of  the  two  groups  on  the  biographical  key  but 
white  students  obtained  higher  grades  and  ratings.  Although  the  biographical 
key  is  valid  for  both  racial  groups,  the  use  of  a  single  regression  line  would 
result  in  over-prediction  of  the  criterion  for  Hegro  students  and  under-predic¬ 
tion  for  white  students. 

d)  The  last  model  observed  in  this  sample  was  Model  6  as  illustrated  in 
the  relationship  between  the  Rank  in  Class  biographical  key  and  math  and  social 
studies  grades.  There  is  no  difference  in  the  mean  scores  of  the  two  groups  on 
the  biographical  key  but  white  students  obtained  higher  grades.  The  key  is 
valid  only  for  the  white  sample.  Although  this  relationship  is  not  likely  to 
provide  any  differential  selection  rates  for  the  two  groups,  the  test  is  not 
appropriate  as  a  predictor  for  the  Negro  sample. 
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Table  (8»C)  presents  the  regression  tests  for  the  analysis  of  covariance 
( Fotthof f ,  1969)  for  the  twelfth  grade  sample.  Differences  in  the  stage  of 
the  regression  line  for  the  two  groups  were  observed  In  sixteen  predictor  -  criterion 
relationships.  Likewise,  differences  in  intercepts  for  the  two  groups  were  also 
observed  in  sixteen  relationships. 

One  must  conclude  from  the  results  of  the  twelfth  grade  sanple,  that 
using  biographical  data  as  predictors  does  not  necessarily  reduce  Dean  dif¬ 
ferences  for  different  ethnic  groups.  In  this  sample  only  one  biographical 
key  yielded  nonsignificant  differences  in  mean  scores  for  the  two  racial  groups. 

In  addition,  the  results  from  the  ninth  grade  sanple  illustrate  that  ellainatlon 
of  racial  differences  in  predictor  performance  does  not  always  reduce  test  bias. 

In  fact  if  there  exists  a  racial  difference  in  criterion  performance  reduction 
of  differences  in  mean  predictor  performance  actually  produces  test  bias. 


Table  13:  Analysis  of  Covarance  for  Homogeneity  of  Regression 

Iflnth  Grade 


Predictor 


Criterion 

C.T.M.M.  1  G.P.A.  Key 

1  *1^  f2^ 

F3^)  Fx  P2  |  F3 

Grades 

Bngllsh 

6.16** 

(2,197) 

3«k3 

(1,197) 

8.78** 

(1,196) 

lk. 03** 
(2,215) 

•  k2 

(1,215) 

27.71** 

(1,216) 

Math 

•  91 

(2,197) 

.67 

1,197) 

1.15 

(1,198) 

k.78**  I 
(2,215)  | 

1.19 

(1,215) 

8.36** 

(1,216) 

Science 

.10 

(2,197)  1 

.03 

1,197) 

.18 

(1,198) 

2.2k 

(2,215) 

1.78 

(1,215) 

2.69 

(1,216) 

Social 

Studies 

1  2.83 
(2,197) 

3-83 

(1,197) 

1.82 

(1,198) 

5.96** 

(2,215) 

l.kl 

(1,215) 

10. k8** 
(1,216) 

Average 

Grade  Point 

1.93 

(2,197) 

1.83 

(1,197) 

2.02 

(1,198) 

8.87** 

(2,215) 

.29 

(1,215) 

17.51** 

(1,216) 

Teacher  Rating 

Leadership 

2.02 

(2,198) 

•  3k 

(1,198) 

3*71 

(1,199) 

lk.88** 

(2,217) 

1.90 

(1,217) 

27.75** 

(1,218) 

Creativity 

m 

1.16 

(1,199) 

9-53** 

(2,217) 

1.90 

(1,217) 

27-75** 

(1,218) 

Criterion 

Predictor 

Alpha  Biographical  Inventory 

Leadership  Key  1  Creativity  Key 

1  'l  *2  1  '3  1  *1  1  *2  1  F3 

Grades 

English 

Ik. 67** 
(2,215) 

1.6k 

(l,215) 

1 

27.62** 

(1,216) 

lk.75** 

(2,215) 

iHffll 

28.06** 

(1,216) 

Table  13  (Contd) 
Ninth  Grade 


Criterion 

Predictor 

1 

Alpha  Biographical  Inventory 

Leadership  Key  l  Creativity  Key 

F1 

F2  1  f3  1 

1  F1 

1  f2  I 
> 

F3 

Math 

5.31* ** 

(2,215) 

1.30 

(1,215) 

9.30** 

(1,216) 

5.22** 

(2,215) 

1.02 

(1,215) 

9.43** 

(1,216) 

Science 

1.98 

(2,215) 

.37 

(1,215) 

3-59 

(1,216) 

2.05 

(2,215) 

.40 

(1,215) 

3-71 

(1,216) 

Social 

Studies 

1 

6.99** 

(2,215) 

2.5^ 

(1,215) 

11.36** 

(1,216) 

6.91** 

(2,215) 

2.2 

(1,215) 

11.63** 

(1,216) 

Grade  Point 
Average 

9.k2** 

(2,215) 

1.13 

(1,215) 

17.70** 

(,216) 

9.57** 

(2,215) 

•  90 

(1,215) 

18.24** 

(1,216) 

Teacher  Ratings 

Leadership 

16.85** 

(2,217) 

1-96 

(1,217) 

31.62** 

(1,218) 

16.82** 

(2,217) 

2.13 

(1,217) 

31.3*** 

(2,217) 

Creativity 

10.99** 

(2,217) 

•  59 

(1,217) 

21.44** 

(1,218) 

11.15** 

(2,217) 

.61 

(1,217) 

21.72** 

(2,217) 

*  P  (  -05 

**  p  ‘  .01 

(1)  Pi  teats  hypothesis  that  £(7^1  Xjj)  =  a  ♦  bXjj  for  all  1  groups 

(2)  ?2  tests  hypothesis  that  F( j  |  Xjj)  =  aA  ♦  bX^  for  all  1  groups 

(3)  P3  tests  hypothesis  that  FfY^J  Xjj)  =  a  +  bjX^j  for  all  1  groups 

(valid  test  only  If  P  is  not  significant) 


J 
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Table  lb:  Analysis  of  Covariance  for  Homogeneity  of  Regression 


Twelfth  Grade 


Predictor 

Alpha  Biographical  Inventory 

Criterion 

C.T.M.M. 

1  Rank  in  Class  Key  1 

>x(1) 

p2(£) 

,3<3) 

F1 

m 

Rank  in  Class 

5.71** 

(2,235) 

10.60** 

(1,235) 

.78 

(1,236) 

7  " 

11.26** 

(2,295) 

1.37 

(1,295) 

21.12** 

(1,296) 

Grades  ] 

English 

8.11** 

(2,23b) 

16.21** 

(1,23*) 

.01 

(1,235) 

18.22** 

(2,295) 

8.76“ 

(1,2951 

26.95 
( ,2 96) 

Math 

ma 

E22H 

|2gg| 

b.bO 

(l,2bl) 

Science 

1.09 

(2,135) 

2.18 

(1,135) 

.01 

(1,136) 

9.63** 

(2,172) 

2.72 
( 1,172 J 

16.37** 

(1,173) 

Social 

Studies 

6.57** 

(2,230) 

10.30** 

(1,230) 

2.73 

(1,231) 

11. bO** 
(2,289) 

11. 19*** 
(1,289) 

11.21 

(1,290) 

Average  Grade 
Point 

6.b6** 

(2,235) 

12.70** 

(1,235) 

.21 

(1,236) 

18.88** 

(2,296) 

10.65** 

(1,296) 

26.26 

(1,297) 

Teacher  Ratings 

leadership 

1.81 

(2,238) 

2.97 

(1,238) 

.65 

(1,239) 

9.42** 

(2,299) 

•  50 

(1,299) 

IB.36** 

(1,300) 

Creativity 

2.b7 

(2,238) 

b.52 

(1,238) 

.41 

(1,239) 

12.56** 

(2,299) 

.80  I 
(1,299) 

2b. 3b** 
(1,300) 
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Table  14  (Contd) 
Twelfth  Grade 


Predictor 


Alpha  Biographical  Inventory 


Criterion 

G 

.P.A.  Key 

1 

!  Leadership  Key 

j 

P1 

P2 

”3 

P1 

P2 

P3 

Rank  in  Class 

6.0C** 
(2,2 95) 

.72 

(1,295) 

HE* 

6.25** 

(2,295) 

BB£g| 

11.48** 

(1,296) 

Grades 


English 

7.64** 

(1,295) 

18.46 

(1,296) 

13.87** 

(2,295) 

8.67** 

(1,295) 

18.59 

(1,296) 

Math 

2.84 

(2,240) 

ESBRI 

3.^3 

(1,241) 

2.88 

(2,240) 

2.36 

(1,240) 

3.38 

(l,24l) 

Science 

6.69** 

(2,172) 

2.69 

(1,172) 

10.60** 

(1,173) 

6.93** 

(2,173) 

B2fS 

10.72** 

(1,173) 

Social 

Studies 

8.16** 

(2,289) 

10.61** 

(1,289) 

5-53 

(1,290) 

8.62** 

(2,289) 

n.57** 

(1,289) 

5.46 

(1,290) 

Average  Grade 
Point 

12. 40** 
(2,296) 

8.63** 

(1,296) 

15-77 

(1,297) 

12.96** 

(2,296) 

9.56** 

(1,296) 

15.90 

(1,297) 

Teacher  Ratings 

Leadership 

6.45** 

(2,299) 

.62 

(1,299) 

12.26** 

(1,300) 

6.50** 

(2,299) 

1  I 

12.27**  \ 

(1,300)  j 

Creativity 

9.26** 

(2,299) 

1.10 

(1,299) 

17.42** 

(1,300) 

9.28** 

(2,299) 

1.13 

(1,299) 

17.41** 

(1,300) 

Criterion _ Creativity  Key  j 

III  ' 


*1 

r 2 

P3 

Rank  in  Class 

6.31** 

(2,295) 

1.47 

(1,295) 

(1,296) 

Grades 

English 

13.61** 

(2,295) 

6.42** 

(1,295) 

18.34 

(1,296) 
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Table  Ik  (Con td) 


Twelfth  Grade 


Criterion 

Predictor 

Alpha  Biographical  Inventory 

Creativity  Key 

F1 

F2 

"3 

Math 

3-05 

3.16 

2.U1 

(2,2U0) 

(1,2U0) 

(l,24l) 

Science 

6.62** 

3-13 

10.00** 

(2,173) 

(1,172) 

(1,173) 

Social  Studies 

8.37** 

11.11** 

5.44 

(2,289) 

(1,289) 

(1,290) 

Average  Grade 

Point 

13-71** 

10.97** 

15.92 

(2,296) 

(1,296) 

(1,297) 

Teacher  Ratings 


Leadership 

6.41** 

.66 

12.17** 

(2,299) 

(1,299) 

(1,300) 

Creativity 

8.97** 

.82 

17.13** 

(2,299) 

(1,299) 

(1,  00) 

*  p  <  .05 

**  p  <  .01 

(1) 

(2) 

(3) 

0») 


*30 


Degrees  of  freedom  for  all  comparisons:  F  -(2,90);  ?2  -(1,920);  F^  (l,9l) 
F^  tests  hypothesis  that  F  (Y^JXjj)  »  a  ♦  bX^  for  all  i  groups. 

P2  tests  hypothesis  that  F  ( )  *  a^  ♦  bXjj  for  all  i  groups. 

F^  tests  hypothesis  that  F  (Y^jx^)  =  a  ♦  for  *  IP^P8* 

(valid  test  only  if  P2  is  not  significant) 


Study  #3:  An  Kxamination  of  the  Variables 
Predicting  Success  in  a  University  Setting 


Sample 

The  sample  consisted  of  students  attending  a  large  state  university. 

Since  predictor  data  vas  not  available  for  all  subjects,  the  saaple  sizes 
for  the  various  predictors  are  different.  The  Negro  group  consisted  of  all 
the  Negro  students  who  entered  the  university  in  September,  1968,  while  the 
white  group  vas  a  random  sample  of  the  remaining  students  who  entered  that 
semester. 

Predictors 

Predictors  included: 

a)  standard  academic  predictors  Scholastic  Aptitude  Test  -  Verbal  (SAT-V), 
Scholastic  Aptitude  Test  -  Mathematical  (SAT-M),  and  high  school  grade  point 
average; 

b)  non-intellectual  variables,  including  the  California  Psychological  Inven¬ 
tory  (CPI),  the  Holland  Vocational  Preference  Inventory  (HVPl) ■ 

c )  an  opinion  questionnaire  known  as  the  University  Student  Census  (USC). 

High  school  grade  point  average  was  a  standardized  score  havlrg  a  mean 
of  3*00  and  a  standard  deviation  of  1.00,  used  to  equate  different  high  school 
grading  systems.  The  CPI  consists  of  Id  scales,  with  an  anxiety  scale  developed 
by  Leventhal  (1966),  included  as  a  nineteenth  scale.  The  HVPI  consists  of  11 
scales,  and  the  USC  includes  17  statements  to  which  students  vere  asked  to 
nark  their  agreement  or  disagreement.  The  statements  of  the  USC  are  as 


follows : 


30.  Most  of  ay  courses  are  stimulating  and  exciting. 

30a.  The  University  should  actively  recruit  black  students. 

31*  Most  faculty  advisors  here  act  like  they  really  care  about  students. 

32.  Most  instructors  here  act  like  they  really  care  about  students. 

33.  Most  administrators  here  act  like  they  really  care  about  students. 

3k.  The  University  should  use  its  Influence  to  Improve  social  conditions  in 
the  state. 

35*  University  students  have  ample  opportunity  to  participate  in  University 
policy  aaklng. 

36.  I  aa  here  for  an  education;  let  other  people  "get  Involved"  on  caapus. 

37.  There  should  be  a  special  college  for  new  students  undecided  as  to  their 
major. 

38.  The  University  should  suspend  students  who  disrupt  the  normal  operation 
of  the  University. 

39.  Most  courses  require  intensive  study  and  preparation  outside  the  class- 
roan. 

kO.  Most  organised  student  activities  on  caapus  are  ridiculous, 
kl.  Major  University-wide  events  draw  lots  of  support  and  enthuslasa. 
k2.  There  are  aany  facilities  and  opportunities  on  caapus  for  individual 
creative  activities. 

k3.  At  the  beginning  of  a  course,  there  is  no  way  to  tell  who  will  get  what 
grades. 

kk.  Channels  for  expressing  student  complaints  are  readily  available. 
k5.  I  was  a  leader  in  high  school. 

Mean  predictor  scores  are  given  in  Table  15.  On  the  three  traditional 
academic  predictors,  the  Negro  subgroup  scores  were  significantly  lover  than 
the  acores  of  the  white  subgroup.  For  the  non-acadealc  predictors  it  becomes 
less  meaningful  to  speak  of  one  group  scoring  higher  than  another.  However, 
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Tab!*  15:  Predictors  -  Means,  Standard  Deviations,  M's, 
and  Tests  of  Significance  of  Mean  Differences 


Predictor 

Group 

X 

s 

1 

t<D 

SAT-V 

Total 

469-4 

97.2 

304 

White 

501.8 

95.4 

178 

Negro 

423.5 

80.0 

126 

1.50** 

SAT-M 

1 

Total 

White 

Msgro 

492.8 

533.1 

435-8 

106.5 

98.9 

89.9 

304 

178 

126 

8.74** 

High  School 

Total 

2. 99 

•  99 

304 

Grade  Point 

White 

3*17 

1.00 

178 

Average 

Msgro 

2.72 

•  92 

126 

3.63** 

California  Psychological 
Inventory 


Dominance 

Total 

25.42 

6.55 

272 

-  1 

White 

25.72 

6.72 

193 

Msgro 

24.67 

6.07 

79 

1.20 

Capacity  for 

Total 

17.47 

4.17 

272 

j 

Status 

White 

17.89 

4.17 

193 

Negro 

16.44 

4.03 

79 

2.62** 

Sociability 

Total 

■RM 

272 

White 

193 

Msgro 

■ 

Kfl 

79 

1.34 

Social 

Total 

33.21 

6.41 

272 

Presence 

White 

33.91 

6.63 

193 

Msgro 

31.52 

5.52 

79 

3.04** 

Self 

Total 

H- 

4.22 

272 

Acceptance 

White 

4.26 

193 

Msgro 

4.02 

79 

1.49 

Sense  of 

Total 

31.69 

6.75 

272 

Well-Being 

White 

32.22 

6.66 

193 

Msgro 

30.41 

6.85 

79 

2.01* 

Responsibility 

Total 

28.46 

5.44 

272 

White 

28.59 

5.50 

193 

Negro 

28.13 

5.31 

79 

O.63 

_ J 

-  —  —  _  —  —  — 
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Table  15  (Contd) 


Prediction 

Group  i 

x  ! 

1  s 

N 

t<D 

Socialization 

Total 

36.52 

6.89 

272 

White 

36.53 

7.05 

193 

Negro 

36.51 

6.50 

79 

0.02 

Self-Control 

Total 

24.46 

8.66 

272 

White 

24.09 

8.82 

193 

Negro 

25-37 

8.25 

79 

1.10 

Tolerance 

Total 

19.03 

5.23 

272 

White 

19.65 

5-19 

193 

Negro 

17.51 

5-05 

79 

2.20* 

Good 

Total 

14.42 

6.12 

27  2 

Impression 

White 

14.22 

6.27 

193 

Negro 

14.91 

5.74 

79 

0.96 

Commonality 

Total 

— 

3-41 

272 

White 

3-34 

193 

Negro 

HEE9I 

3-53 

79 

2.04* 

Achievement 

Total 

if 

5.28 

272 

via  Conference 

White 

5.45 

193 

Negro 

■9 

4.84 

79 

0.97 

Intellectual 

Total 

34.92 

6.4? 

272 

Efficiency 

White 

35-47 

6.19 

193 

Negro 

33-57 

7-04 

79 

2.20* 

Psychological 

Total 

9- 61 

2.92 

272 

Minded ness 

White 

9-93 

3-01 

193 

Negro 

9.49 

2.69 

79 

1.12 

Flexibility 

Total 

9.66 

4.04 

272 

White 

10.04 

4.12 

193 

Negro 

8.60 

3-74 

79 

2.30* 

Femininity 

Total 

20.36 

5.19 

272 

White 

20.34 

5.22 

193 

Negro 

20.42 

5.17 

79 

0.11 

Leventhal’ s 

Total 

5-41 

2.41 

272 

Anxiety  Scale 

White 

5.4o 

2.42 

193 

Negro 

5-42 

2.41 

79 

0.06 

Holland  Vocation  Performance 

Index 

Realistic 

Total 

mmmm 

mmam 

230 

j 

White 

1 

164 

Negro 

Hi 

66 

0.96 

Table  15  (Contd) 


Predictor _ I  Group  Ixl  s  1  If  1  t<D 


Intellectual 

Total 

White 

Hegro 

4.21 

4.06 

4.59 

4.18 

4.10 

4.36 

1 

230 

164 

66 

0.87 

Social 

Total 

5.10 

3-92 

230 

White 

4.82 

3.89 

164 

Hegro 

5.80 

3.93 

1 

66 

1.72 

Conventional 

Total 

2.04 

2.93 

230  i 

White 

1.59 

2.51 

164 

Hegro 

3.17 

3.55  j 

66 

3.28** 

Enterprising 

Total 

2.85 

2.93  1 

230  j 

White 

2.79 

2.87 

164 

Hegro 

3.00 

3.08 

66 

0.49 

Artistic 

Total 

4.19 

3.98 

230 

White 

3.90 

3.82 

164 

Hegro 

4.92 

4.31 

1 

66 

1.76 

Self  Control 

Total 

10.07 

3-59 

230 

White 

9.93 

3.50 

164 

Hegro 

10.41 

3.84 

66 

|  0.91 

Masculinity 

Total 

6.25 

3.09 

230 

White 

6.23 

3.11 

164 

Hegro 

6.30 

3.04 

66 

0.15 

Status 

Total 

8.29 

2.28 

230 

White 

8.19 

2.21 

164 

Hegro 

8.55 

2.47 

66 

1.07 

Infrequency 

Total 

6.01 

2.95 

230 

White 

5.78 

2.89 

164 

Hegro 

6.59 

3-03 

66 

1.89 

Acquiescence 

i  Total 

9.31 

5.03 

230 

White 

8.95 

4.52 

164 

Hegro 

10.21 

6.05 

66 

1.52 

University  Student  Census 

#30 

Total 

2.47 

1.08 

246 

White 

2.39 

1.00 

141 

Hegro 

[  2.58 

1.16 

105 

1.3* 

#30e 

Total 

1.64 

1.49 

250 

White 

2.34 

1.32 

144 

Hegro 

0.68 

1.13 

106 

10.39** 

Table  15  (Contd) 


Predictor 

Group 

X 

8 

H 

t<D 

#31 

Total 

wm 

1.18 

250 

White 

1  E 3 

1.11 

144 

negro 

1.25 

106 

1.70 

#32 

Total 

2.27 

1.13 

248 

White 

2.00 

0.93 

143 

negro 

2.63 

1.27 

105 

4.29** 

#33 

Total 

2.57 

wmwtm 

250 

White 

2.43 

■ 

144 

negro 

2.75 

IIEmI 

106 

2.10* 

#3* 

Total 

1.70 

HBE3H 

250 

White 

1.86 

144 

negro 

1.49 

106 

2.07* 

#35 

Total 

2.38 

1.32 

250 

White 

2.17 

1.17 

144 

negro 

2.66 

1.47 

106 

2.82** 

#36 

Total 

2.76 

— 

250 

White 

2.80 

144 

negro 

2.69 

BEfl 

106 

0.71 

#37 

Total 

1.98 

HI 

250 

White 

1.99 

144 

negro 

1.95 

IB 

106 

#38 

Total 

2.27 

1.34 

149 

White 

1.92 

1.32 

144 

negro 

2.73 

1.24 

105 

#39 

Total 

1.46 

1.14 

250 

White 

1.65 

1.12 

144 

negro 

1.22 

1.13 

106 

2.98** 

#4o 

Total 

2.69 

■QVjK 

250 

White 

2.68 

144 

negro 

2.70 

KB 

106 

0.14 

#41 

Total 

2.05 

1.26 

250 

White 

1.94 

1.11 

144 

negro 

2.20 

1.44 

106 

1.54 

#42 

Total 

1.67 

1.21 

250 

White 

1.53 

1.10 

144 

negro 

1.64 

1.34 

106 

1.94 

Table  15  (Contd) 


Predictor 

Group 

X 

8 

H 

tU) 

#43 

Total 

1.69 

1.33 

250 

White 

1.76 

1.30 

144 

Hegro 

1.59 

1.36 

106 

1.00 

#44 

Total 

2.3 2 

1.14 

249 

White 

2.25 

1.04 

144 

Hegro 

2.42 

1.27 

105 

1.12 

#45 

Total 

1.81 

mmm 

247 

White 

1.90 

■ 

142 

Hegro 

1.68 

■Sal 

105 

1.36 

(1)  t  ratios  are  between  the  means  of  the  white  and  Hegro  samples 
*  P  <  -05 

**  p  <  .01 
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there  were  significant  differences  between  the  racial  groups  on  eight  of 
the  nineteen  CPI  scales,  one  of  the  eleven  HVPI  scales,  and  seven  of  the 
seventeen  USC  items. 

In  the  eight  instances  vhere  mean  differences  were  found  on  the  CPI 
scales ,  the  mean  of  the  white  subgroup  was  of  greater  magnitude  than  the  mean 
of  the  Negro  subgroup.  The  scales  included  Capacity  for  Status,  Social 
Presence,  Sense  of  Well-Being,  Tolerance,  Comma  lit  y.  Achievement  via  Inde¬ 
pendence,  Intellectual  Efficiency,  and  Flexibility.  The  only  HVPI  scale  for 
which  a  mean  difference  was  found  was  the  Conventional  scale,  on  which  the 
Negro  subgroup  has  a  greater  mean  score  than  the  vhlte  subgroup.  In  four  of 
the  seven  instances  where  significant  differences  were  found  on  USC  items,  the 
white  subgroup  agreed  more  with  the  statement  than  the  Negro  subgroup.  These 
statements  were:  "Most  instructors  here  act  like  they  really  care  about 
students";  "Most  administrators  here  act  like  they  really  care  about  students"; 
"University  students  have  ample  opportunity  to  participate  in  University  policy 
making";  and  "The  University  should  suspend  students  who  disrupt  the  normal 
operation  of  the  University."  The  Negro  subgroup  agreed  more  than  the  vhite 
subgroup  with  the  following  statements:  "The  University  should  actively  re¬ 
cruit  black  students";  "The  University  should  use  its  influence  to  improve 
social  conditions  in  the  state";  and  "Most  courses  require  intensive  study 
and  preparation  outside  the  classroom." 

Criterion 

The  criterion  used  was  grade  point  average  at  the  end  of  the  freshman 
year.  Since  the  sample  sizes  were  substantially  different  for  the  various 
predictors,  a  separate  test  for  criterion  differences  was  made  for  each  of 
the  predictors.  In  every  case,  the  Negro  sample  scored  significantly  lower 
than  the  white  sample  on  the  criterion  measure.  The  criterion  data  for  each 
predictor  group  is  presented  in  Table  16. 


Table  16:  Criterion  -  Means,  Standard  deviations.  M's,  and 
Testa  of  Significance  of  Mean  Differences  for  each  Group 


of  Predictors 


Criterion 

!  Group 

X 

s 

N 

tU> 

Grade  Point  Average 

Total 

2.00 

.84 

304 

(Academic  Predictors) 

White 

2.22 

.68 

178 

Negro 

1.69 

.68 

126 

5.90** 

Grade  Point  Average 

Total 

2.06 

•86 

272 

(CPI  as  a  Predictor) 

White 

2.20 

.88 

193 

4.92** 

Negro 

1.70 

.70 

79 

Grade  Point  Average 

Total 

2.14 

.81 

230 

(HVPI  as  a  Predictor) 

White 

2.27 

.63 

164 

4.26** 

Negro 

1.82 

.67 

66  ! 

Grade  Point  Average 

Total 

2.21 

t 

.08 

251 

(USC  as  a  Predictor) 

White 

2.47 

.83 

144 

7.65** 

. 

Negro 

1.88 

•  59 

107 

t  ratios  are  between  the  means  of  the  vhite  and  Negro  samples. 

**  p  <.01 
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Validity 

Table  17  presents  the  validity  coefficients  for  the  total,  white,  and 
Negro  saaplea.  The  standard  academic  predictors  (SAT-V,  3AT-M,  and  high  school 
grades)  are  the  best  predictors  of  college  success  for  both  racial  groups.  The 
validity  coefficients  of  the  SAT's  are  of  greater  magnitude  for  the  Negro  sub¬ 
group,  but  high  school  grades  do  not  predict  as  well  for  the  Negro  group  as 
for  the  white  group. 

The  magnitude  of  the  non-intellectual  validity  coefficients  is  generally 
lover  than  the  academic  variables.  Nevertheless,  ten  of  the  nineteen  CPI 
scales  were  significantly  related  to  university  grade  point  average  for  the 
white  group;  and  seven  were  significantly  related  for  the  Negro  group.  Simi¬ 
larly,  five  HVPI  scales  reached  significance  for  the  white  sample,  and  three 
for  the  Negro  sample.  Of  the  seventeen  USC  items  only  three  exhibited  a  sig¬ 
nificant  relationship  for  the  Negro  group,  and  none  for  the  white  sample. 

Models  Illustrated 

In  twenty -two  of  tie  fifty  predictor-criterion  relationships  examined, 
the  predictor  was  not  valid  for  either  racial  group  or  for  the  total  group. 

The  remaining  cases  illustrated  the  following  Bartlett -O' Leary  models  (1969) : 

a)  There  were  six  cases  of  Model  1,  in  which  the  predictor  was  valid  for 
both  racial  groups  although  there  were  significant  mean  differences  on  both 
the  predictor  and  the  criterion.  Since  this  procedure  would  result  in  the 
selection  of  those  most  likely  to  succeed,  there  is  Justification  for  employ¬ 
ing  the  predictor  in  a  selection  decision.  All  three  standard  academic  pre¬ 
dictors  are  found  among  the  examples  of  Model  1.  However,  the  significant 
validity  coefficient  for  the  Negro  high  school  grade  point  average  is  much 
lower  than  that  obtained  for  the  white  sample  and  the  test  for  homogeneity  of 
of  regression  indicated  that  the  regression  slopes  were  different  for  the  two 
racial  groups. 
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Table  17  (Contd) 
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indicates  those  models  in  which  a  significant  difference  exists  between  the  validity  coefficients 
for  the  two  ethnic  groups. 


b)  There  ere  five  examples  of  Model  3  in  the  date:  three  CPI  scales 
and  two  HVPI  scales*  Although  these  scales  are  equally  valid  Id  the  racial 
groups,  the  significantly  lower  criterion  performance  of  the  Hegro  sample 
would  lead  to  a  selection  of  more  Negroes  who  would  fail  to  meet  criterion 
standards* 

c)  For  the  models  describing  differential  validity,  there  were  eight 
examples  of  Model  6  where  there  was  no  difference  for  the  predictor  scores 

but  one  subgroup  obtained  higher  criterion  scores,  and  four  examples  of  Model  8 
where  there  were  significant  subgroup  mean  differences  in  both  predictor  and 
criterion  performance  but  significant  validity  for  only  one  subgroup.  In 
seven  cases  the  predictor  was  valid  for  only  the  white  sample,  and  in  five 
cases  for  only  the  Negro  subgroup. 

d)  Five  examples  of  Model  11  were  found.  In  these  cases,  although  the 
predictor  appears  valid  vhen  looking  at  the  total  group,  it  has  no  validity  in 
either  racial  group. 

There  are  two  cases  which  illustrate  a  more  stringent  requirement  that  the 
validity  coefficients  of  the  racial  groups  be  significantly  different  for  iden¬ 
tification  of  differential  validity.  The  first  is  the  Infrequency  scale  of  the 
HVPI,  which  was  identified  as  a  case  of  Model  6  by  the  first  method  of  analysis. 
The  second  Instance  is  high  school  grade  point  average  which  was  identified  as 
Model  1  by  the  first  method  of  analysis.  Even  though  high  school  grades  are 
a  valid  predictor  (p  < .01)  for  both  racial  groups,  the  validity  coefficient 
is  sufficiently  lower  for  the  Negro  group  to  make  the  difference  in  coefficients 
significant  (p  ^  *05). 

Because  of  the  large  criterion  differences  between  the  Negro  and  white 
groups,  the  test  for  homogeneity  of  regression  indicates  a  separate  Intercept 
in  each  case  in  which  the  test  is  appropriate.  (See  Table  18) 


& 


Predictor  _ Criterion  <ffA 


Fx(2) 

F2(3) 

*3(4) 

SAT-V 

2.30 

.00 

4.61* 

8AT-M 

3.15* 

•13 

6.19* 

High  School 

12.57** 

8.84** 

15.89** 

GPA 

2,300) 

(1,300) 

(1,301) 

CPI 

Dominance 

9.48** 

.18 

I8.83** 

Capacity  for  Status 

8.14** 

.01 

16.32** 

Sociability 

9.73** 

.11 

19.41** 

Social  Presence 

11.37** 

1.79 

20.90** 

Self -Acceptance 
Sense  of  Well- 

9.71** 

.08 

19.42** 

Being 

8.57** 

•  13 

17.05** 

Responsibility 

9.85** 

.08 

19.67** 

Socialization 

10.90** 

•  53 

21.30** 

Self-Control 

11.45** 

.77 

22.15** 

Tolerance 

7.68** 

1.26 

14.10** 

Good  Impression 

10.51** 

.09 

20.99** 

Cm— nnslity 
Achievement  Via 

9.03** 

1.10 

16.96** 

Conf . 

11.51** 

.05 

23.05** 

Ach.  Via  Indep. 

7.13** 

.02 

14.29** 

Intellectual  Effic. 
Psychol.  Minded- 

8.0h** 

.08 

16.05** 

ness 

10.92** 

2.78 

18.94** 

Flexibility 

8.81** 

.11 

17.57** 

Femininity 
Leventhal's  Anx. 

10.30** 

.02 

20.65** 

Scale 

10.04** 

.14 

20.00** 

Holland 


Realistic 

8.07** 

1.50 

14.60** 

Intellectual 

8.72** 

1.16 

16.27** 

Social 

9.62** 

.04 

I9.2C** 

Conventional 

8.66** 

1.79 

15.48** 

Enterprising 

7.69** 

.10 

15.35** 

Artistic 

10.33** 

1.69 

18.91** 

Self-Control 

8.16** 

.76 

15.58** 

Masculinity 

7.94** 

.06 

15.88** 

Status 

9.14** 

•  99 

17.28** 

Infrequency 

12.04** 

9.14** 

14.43** 

Acquiescence 

8.35** 

.49 

16.24** 

(2,226) 

(1,226) 

(1,227) 
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Table  18  (Contd) 


Predictor 


Criterion  GPA. 


uac 

#30 

28.40** 

2.81 

#30a 

17.82** 

.05 

#31 

28.32** 

.18 

#32 

22.25** 

•31 

#33 

27.98** 

.36 

#3^ 

30.60** 

.75 

#35 

28.64** 

.18 

#36 

27.17** 

.02 

#37 

29.61** 

3*40 

#38 

28.97** 

•  91 

#39 

25.97** 

•  35 

#40 

27.96** 

•  59 

#41 

26.36** 

1.67 

#42 

26.76** 

.06 

#43 

28.03** 

1.19 

#44 

26.1b** 

.00 

#45 

26. 61** 

.00 

1 

1 

53-59** 

35.73** 


55.75** 


57.29** 
5 


55.27** 


49.72** 


53.70** 


(1)  Degrees  of  freedom  for  each  ratio  are  shown  in  parentheses 
below  each  column. 

(2)  Px  tests  hypothesis  that  EfY^  |  X±j )  *  a  ♦  b  for  all  i  groups. 

(3)  *2  te<rts  hypothesis  that  ECYij  |  Xjj)  =  a^  ♦  b  Xi(j  for  all  i  groups. 

(4)  tests  hypothesis  that  EfYj.  (  X^j)  *  a^  ♦  bA  X^j  for  all  i  groups 


Multiple  Regression  Analysis 

In  addition  to  the  zero-order  correlational  analysis,  a  multiple  regres¬ 
sion  analysis  was  performed  using  SAT-V ,  SAT-M,  and  high  school  grade  point 

f 

average.  The  results  of  this  analysis  are  found  in  Table  19*  Both  racial 
groups  appear  equally  predictable  using  the  multiple  predictor.  There  is  a 
difference  of  only  .00  between  the  cross-validated  coefficients  of  the  groups. 
Although  sample  size  did  not  permit  cross-validation,  multiple  regression 
analyses  were  performed  for  sexually  and  racially  divided  groups.  Table  19 
shows  the  multiple  R's  for  the  groups  to  be  very  similar,  ranging  from  .61 
to  .67. 
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Table  19:  Cross-validated  Multiple  R's 
Predictors:  SAT-V,  SAT-M,  and  High  School  Grade  Point  Average 


Multiple  R 

Cross-validated  R 

M 

Total 

.67 

.62 

152 

White 

.62 

.61 

89 

Megro 

.61 

•  59 

63 

White  Male 

.63 

79 

Megro  Male 

.61 

6k 

White  Female 

.65 

99 

Megro  Female 

.67 

62 

Megro  Female 


Study  #4:  The  Prediction  of  Work  Success  in  a 


1 

RB  ’■ 

l 

i 
? 

Health  Insurance  Company 

Sample  ] 

...  ...  j 

The  sample  consisted  of  209  workers  for  a  large  health  insurance  conp- 
any.  Of  the  209,  158  were  white  and  51  were  Negro.  The  employees  worked 
as  approvers,  coders,  keypunch  operators,  special  assistants,  and  computer 
operators. 

Predictors 

The  first  predictor  was  the  Thur stone  Test  of  Mental  Alertness  (TMA) 
which  provided  a  verbal,  quantitative,  and  total  score.  The  second  predictor, 
the  Pictorial  Reasoning  Test,  was  developed  by  Science  Research  Associates 
with  a  non-verbal  format  and  has  been  referred  to  as  a  culture-free  test.  Both 
tests  were  administered  to  new  employees  but  were  not  used  for  selection 
purposes. 

Significant  differences  in  performance  were  found  (p  <  .01)  between  the 
white  and  Negro  groups  on  the  verbal,  quantitative,  and  total  scales  of  the 
TMA,  but  no  significant  difference  was  found  on  the  Pictorial  Reasoning  Test. 

Predictor  means  are  presented  in  Table  20. 

Criteria 

All  seven  criteria  were  ratings  of  workers  by  their  supervisors  along  a 
five  point  scale.  Workers  were  rated  on  their  quantity  of  work,  quality  of 
work,  accuracy,  knowledge  of  their  Job,  Job  aptitude,  flexibility,  and  overall 
effectiveness.  The  mean  rating  for  the  white  group  was  significantly  (p  <  .05) 
higher  than  that  for  the  Negro  group  on  three  criteria:  quantity  of  work. 

Job  aptitude,  and  overall  effectiveness.  Criterion  means  for  the  total  group 
and  the  two  subgroups  are  shown  in  Table  21. 
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Table  20:  Predictors  -  Means,  Standard  Deviations,  If*  s, 
and  Tests  of  Significance  of  Mean  Differences 
Total  Insurance  Workers 


Predictor 

Group 

X 

s 

H 

t<o 

Thurstone  Test  of 

Mental  Alertness 

Verbal 

Total 

31-42 

10.92 

209 

White 

33*99 

9.78 

158 

Negro 

23.4? 

IO.38 

51 

6.55** 

Quantitative 

Total 

?2.6? 

8.82 

209 

White 

24.02 

e.49 

158 

Negro 

18.49 

8.50 

51 

4.02** 

Total 

Total 

c4 . 14 

18.55 

209 

White 

58.07 

17.03 

158 

Negro 

41.96 

17.73 

51 

5-79*» 

Pictorial 

Total 

46.64 

8.04 

209 

Reasoning  Test 

White 

4^  .02 

6.09 

158 

Negro 

45.4" 

7.77 

51 

1.20 

t  ratios  are  between  the  means  cf  the  white  and  Negro  samples. 
**  p  <  .01 
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Table  21:  Criteria  -  Means,  Standard  Deviations,  N*s, 
and  Tests  of  Significance  of  Mean  Differences 


Total  Insurance  Workers 


Criterion 

Group 

X 

s 

N 

t(D 

Quantity  of  Work 

Total 

3.46 

.68 

209 

White 

3-52 

.67 

158 

Negro 

3-25 

.69 

51 

2.47* 

Quality  of  Work 

Total 

3-53 

.75 

209 

White 

3-55 

.77 

158 

Negro 

3.43 

•  73 

51 

•  98 

Accuracy 

Total 

3.38 

.78 

209 

White 

3.41 

•78 

158 

Negro 

3-27 

.75 

51 

1.32 

Knowledge  of  Job 

Total 

3-58 

.68 

20 9 

White 

3.63 

.83 

158 

Negro 

3.49 

.81 

51 

1.65 

Job  Aptitude 

Total 

3-70 

.75 

208 

White 

3-75 

.73 

157 

Negro 

3.49 

.81 

51 

2.14* 

Flexibility 

Total 

3.16 

.89 

208 

White 

3-22 

.89 

157 

Negro 

2.98 

.86 

51 

1.68 

Overall 

Total 

5-53 

.74 

208 

Effectiveness 

White 

3-59 

.76 

157 

Negro  1 

3.33 

.68 

51 

2.17* 

t  ratios  are  betveen  the  means  of  the  white  and  Negro  samples. 
*  P  <-05 
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Validity 

Predictor-criterion  correlations  are  presented  in  Table  22.  Only  one  re¬ 
lationship  reached  significance  (p  <  .05)  for  the  Negro  sample,  vhile  eight 
correlations  were  significant  for  the  white  subgroup.  The  correlation  trends 
for  the  two  groups  were  different  with  25  of  28  correlations  for  the  Negro  group 
being  negative  correlations  while  all  the  correlations  for  the  white  group  were 
positive.  Quantity  of  work  was  the  most  predictable  criterion  for  the  white 
group.  All  four  test  scales  correlated  significantly  (p  < .05)  with  the  ratings 
on  that  measure.  Two  of  the  TMA  scales  were  significantly  related  to  quality  of 
work  and  flexibility  for  the  white  group.  The  only  significant  relationship 
found  for  the  Negro  group  was  between  the  Pictorial  Reasoning  Test  and  ratings 
of  overall  effectiveness.  Although  the  Pictorial  Reasoning  Test  tends  to  mini* 
mlze  the  difference  in  mean  performance  between  the  white  and  Negro  groups,  its 
validity  is  no  higher  than  that  of  the  TMA. 

Models  Illustrated 

a)  Three  examples  of  Model  8  were  Illustrated  in  the  prediction  of  quantity 
of  work  from  TMA  scales.  There  is  a  significant  mean  difference  on  both  the 
predictor  and  the  criterion,  with  a  significant  validity  coefficient  for  only 
the  white  group.  If  the  TMA  were  used  to  select  for  the  criterion,  quantity  of 
work,  these  examples  of  Model  8  would  not  lead  to  unfair  discrimination  since 
the  mean  differences  fall  on  both  the  predictor  and  the  criterion.  However,  the 
low  validity  coefficients  preclude  any  practical  use  of  the  predictors  even 
though  they  are  significant  for  the  total  group. 

b)  Three  examples  of  Model  7  were  found.  The  TMA -Quantitative  scale  and 
the  TMA-Total  scale  are  valid  predictors  of  quality  of  work  for  the  white  group, 
and  the  TMA-Total  scale  is  a  valid  predictor  of  flexibility  for  the  white  group. 
However,  none  of  these  correlations  are  significant  for  the  Negro  group.  Moreover, 
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Table  22:  Predictor -Criterion  Correlations 


Total  Group,  Whites,  and  Negroes 
Total  Insurance  Worker 2  ^ 


_ Predictor _ 

Thur stone  Test  of  Mental  Alertness 
||  ||  Pictorial  | 


Criterion 

Verbal 

i  Quantitative 

Total 

Reasoning 

N 

Quantity 

Total 

15* 

11** 

15* 

11 

209 

of  Work 

White 

16* 

17* 

18* 

18* 

158 

Negro 

-13 

-13 

-11* 

-19a 

51 

(8) 

(8) 

(8) 

(6) 

Quality 

Total 

08 

10 

09 

03 

209 

of  Work 

White 

13 

18* 

16* 

09 

158 

Negro 

-18 

-19a 

-20a 

-20 

51 

(7) 

(7) 

Accuracy 

Total 

01 

01 

01 

-01 

209 

White 

02 

00 

01 

00 

158 

Negro 

-15 

-06 

-32 

-09 

51 

Knowledge 

Total 

02 

01* 

03 

oi* 

209 

of  Job 

White 

03 

09 

06 

13a 

158 

Negro 

-17 

-20 

-20 

-22 

51 

Job 

Total 

05 

06 

06 

-03 

208 

Aptitude 

White 

06 

09 

07 

01 

158 

Negro 

-19 

-16 

-19 

-21 

51 

Flexibility 

Total 

16* 

13 

16* 

Oh 

208 

White 

17* 

11* 

16* 

06 

157 

Negro 

00 

01 

01 

-08 

51 

(7) 

(7) 

Overall 

Total 

09 

08 

09 

-01 

208 

Effective- 

White 

11a 

11 

11a 

05a 

157 

ness 

Negro 

-2l» 

-18 

-23 

-28* 

51 

(1)  Decimals  are  omitted. 

(2)  Number  in  parentheses  below  the  correlation  for  the  Negro  sample  (6) 
indicates  the  model  illustrated  (see  Appendix  A). 

*  p  <  .os 

a  indicates  significant  difference  between  white  and  Negro  correlations. 


since  Negroes  score  significantly  lover  on  the  predictors  while  there  were  no 
differences  on  the  criterion,  the  use  of  the  IMA  as  a  selection  Instrument 
would  be  unfair  to  tne  Negroes. 

c)  There  were  two  examples  of  Model  6  where  there  were  no  significant  dif¬ 
ferences  on  the  predictor,  but  the  white  group  scored  significantly  higher  on 
the  criterion.  In  one  case,  the  predictor  (Pictorial  Reasoning  Test)  was  valid 
for  the  white  group  alone;  and  in  one  case,  for  the  Negro  group  alone. 

Applying  the  more  stringent  requirement  that  a  significant  difference  exist 
between  the  Negro  and  vhlte  correlations,  four  of  the  nine  models  described  above 
remain.  A  significant  difference  between  Negro  and  white  correlations  was  found 
in  two  other  instances,  but  in  both  cases  the  predictor  was  inappropriate  since 
it  had  not  correlated  with  the  criterion  in  either  subgroup. 

The  results  of  the  analysis  of  covariance  for  homogeneity  of  regression, 
given  in  Table  23,  indicated  that  bias  exists  in  nine  instances  (as  determined 
by  a  significant  Fj  statistic).  Four  of  these  instances  involved  the  prediction 
of  quantity  of  work;  three  involved  prediction  of  knowledge  of  job;  and  two  in¬ 
stances  were  found  in  the  prediction  of  overall  effectiveness.  The  test  Indicated 
that  separate  regression  slopes  were  appropriate  in  six  instances:  the  prediction 
of  quantity  of  work  by  TMA-Q,  1MA-T,  and  the  Pictorial  Reasoning  Test;  the  pre¬ 
diction  of  quality  of  work  by  the  TMA-Q  and  the  TMA-T;  and  the  prediction  of  know¬ 
ledge  of  Job  by  the  Pictorial  Reasoning  Test.  In  the  prediction  of  Job  aptitude 
by  the  TMA-V,  one  F^  statistic  was  significant  indicating  that  separate  inter¬ 
cepts  are  appropriate  for  the  two  racial  groups. 

Cultural  Deprivation 

Since  the  IMA  is  assumed  to  be  culture  bound  and  the  Pictorial  Reasoning 


Test  purports  to  be  a  culture-free  test,  there  is  an  opportunity  to  compare  the 
predictability  of  the  two  tests  in  various  population  subgroups.  If  the  criterion 


Table  2 5  (Contd) 


P  <-01 


measures  can  be  assuned  to  be  culture-free,  then  the  Pictorial  Reasoning  Test 
should  demonstrate  superior  validity,  especially  in  culturally  deprived  groups. 

Two  measures  were  tested  as  indicators  of  cultural  deprivation.  First, 
analyses  were  made  for  the  two  racial  subgroups.  A  second  measure  of  cultural 
deprivation  suggested  by  Guion  (1966)  compares  in  standard  score  units,  the 
culture -bound  test  and  the  culture-free  test.  The  degree  of  discrepancy  between 
the  two  could  prove  to  be  a  measure  of  cultural  deprivation.  This  measure  could 
be  used  either  as  a  moderator,  or  as  a  predictor  itself.  Culturally  advantaged 
and  culturally  deprived  groups  were  thus  identified  by  the  difference  between  the 
1MA-T  and  the  Pictorial  Reasoning  Test.  Those  with  a  positive  difference  score 
vere  considered  culturally  advantaged,  while  those  with  a  negative  difference 
score  were  considered  culturally  deprived.  Positive  and  negative  difference 
score  groups  were  identified  within  both  the  white  group  and  the  Negro  group. 

Table  2h  presents  the  validity  coefficients  for  the  TMA-T  and  the  Pictorial 
Reasoning  Test  for  each  of  the  groups.  It  appears  that  the  TMA  was  more  highly 
correlated  with  criterion  measures  for  the  culturally  advantaged  groups.  Three 
out  of  seven  correlations  were  significant  for  the  white  group  while  none  were 
significant  for  the  Negro  group.  Five  correlations  reached  significance  in  the 
total  group  with  positive  difference  scores  (1WA-T  minus  Pictorial  Reasoning 
Test)  while  none  were  significant  for  the  total  group  with  negative  difference 
scores.  The  TMA  correlated  significantly  with  six  criteria  in  the  white  positive 
group  while  none  were  significant  in  the  white  negative,  and  with  one  in  the 
Negro  positive  while  none  were  significant  in  the  Negro  negative  group. 

The  Pictorial  Reasoning  Test  exhibited  little  validity  for  any  of  the  popula 
tion  subgroups.  Of  77  relationships  only  four  reach  significance.  Three  of 
those  four  are  found  in  groups  which  would  be  identified  as  culturally  advantaged 
Thus,  even  if  a  culture-bound  test  is  inappropriate  for  culturally  deprived 
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Table  2k  :  (Contd) 


P  <  -01 


individuals,  the  use  of  a  culture-free  test  for  those  individuals  did  not  im¬ 
prove  predictability. 

Moreover,  the  relationship  between  the  IMA  and  the  criterion  ratings  appears 
to  be  positive  for  culturally  advantaged  groups  and  negative  for  culturally  de¬ 
prived  groups.  Twenty-one  out  of  28  relationships  between  the  IMA  and  criterion 
measures  were  positive  for  culturally  advantaged  groups,  while  23  out  of  28  were 
negative  for  culturally  deprived  groups.  In  fact,  all  of  the  cases  In  which  the 
IMA  was  negatively  related  to  criterion  measures  in  a  culturally  advantaged 
group  were  in  the  Negro  group  with  positive  difference  scores. 

The  same  pattern  was  found  for  the  Pictorial  Reasoning  Test.  In  the  cultur¬ 
ally  advantaged  groups  21  out  of  28  correlations  were  positive,  while  20  out  of 
28  were  negative  in  the  culturally  deprived  groups.  Again,  all  seven  Instances 
of  a  negative  correlation  of  the  TMA  with  criterion  ratings  in  a  culturally  ad¬ 
vantaged  group  occurred  in  the  Negro  group  with  positive  difference  scores. 

The  pattern  of  relationships  is  similar,  regardless  of  whether  race  or  dif¬ 
ference  score  is  used  to  measure  cultural  deprivation.  In  fact,  the  positive 
difference  score  group  was  composed  of  only  13  Negroes  and  86  whites.  The  nega¬ 
tive  difference  score  group  consisted  of  38  Negroes  and  71  whites.  Hence  the 
majority  of  individuals  of  both  races  receive  the  same  classification  regardless 
of  the  cultural  deprivation  measure. 

Gulon  (1986)  further  proposes  that  cultural  deprivation  itself,  as  measured 
by  a  difference  score,  may  be  related  to  Job  success.  Table  25  presents  the 
correlations  between  the  difference  scores  and  criteria.  Since  only  two  out  of 
63  correlations  reach  significance  (p  <  *05),  there  was  virtually  no  relationship. 

Multiple  Regression 

The  multiple  regression  coefficients  for  each  of  the  criteria,  using  all 
three  TMA  scales  as  well  as  the  Pictorial  Reasoning  Test  and  the  difference  score 
as  predictors,  are  found  in  Table  26  along  with  the  cross-validated  coefficients. 
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Table  25:  Difference  Score  -  Criterion  correlations' 


P  <  .05 


Table  26:  Multiple  Regression  Coefficients  Predictors: 
THA-V,  TMA-Q,  1WA-T,  Pictorial  Reasoning, 
Difference  Score  ('MA-'j)  ~  Pictorial  Reasoning) 


Criterion 

Multiple  R 

SE 

Cross-validated  R 

N 

Quantity  of 

Work 

.23 

•73 

-.42** 

104 

Quality  of 

Work 

.19 

.71 

.03 

104 

Accuracy 

.44 

•  73 

•  34** 

104 

Knowledge  of 

Job 

•  52 

.62 

-.07 

104 

Job  Aptitude 

.1*8 

•75 

-.09 

104 

Flexibility 

.41 

.86 

.29** 

104 

Overall 

Effectiveness 

.46 

•73 

.38** 

104 

**  p  <  .01 
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The  original  sample  used  Included  one  half  of  the  total  group  (if-lOk).  Cross* 
validation  was  performed  in  the  other  half  of  the  sample.  ft>e  shrinkage  of  the 
multiple  R  made  multiple  prediction  Impractical  for  three  criteria:  quality  of 
work;  knowledge  of  job,  and  Job  aptitude.  Moreover,  the  instability  of  the 
weights  made  prediction  of  quantity  of  work  infeasible.  The  cross-validated 
coefficients  are  significant  (p  <  .01)  in  the  prediction  of  accuracy  (R*.3*0, 
flexibility  (R».29),  and  overall  effectiveness  (R*. 38) . 


Study  #5i  Prediction  of  Success  for  Clerical  Workers 
of  a  Health  Insurance  Company 


Sample 

The  sample  consisted  of  a  more  homogeneous  subgroup  of  126  clerical  workers 
in  the  health  insurance  company  used  in  the  previous  study.  Ninety-five  workers 
were  white,  and  31  vere  Negro.  All  employees  included  in  the  sample  were  employed 
as  clerks  or  clerk- typists. 

Predictors 

The  same  predictors  employed  for  the  total  group  of  health  insurance  workers 
in  the  previous  study  were  analyzed  for  this  sample.  White  clerical  vorkers  scored 
significantly  higher  than  Negro  vorkers  on  all  three  scales  of  the  IMA  (p  ^  .01). 
However,  there  was  no  significant  difference  between  the  two  ethnic  groups  on  the 
Pictorial  Reasoning  Test.  Predictor  means  are  presented  in  Table  27. 

t 

Criteria 

Criterion  ratings  along  the  same  dimensions  employed  in  the  previous  study 
were  used  as  criteria.  Although  the  mean  Job  performance  ratings  for  the  white 
group  were  higher  than  for  the  Negro  group  on  all  seven  criterion  scales,  the 
difference  was  significant  on  only  one,  Job  aptitule  (p  {  .05).  Mean  criterion 
scores  are  presented  in  Table  2 8. 

Validity 

Neither  the  IMA  nor  the  Pictorial  Reasoning  Test  was  a  valid  predictor  for 
either  racial  group.  Of  the  84  predictor-criterion  correlations  presented  in 
Table  29  only  one  reached  significance  (p  <  .05).  A  larger  number  of  significant 
correlations  could  be  expected  on  the  basis  of  chance  factors  alone.  However,  a 


Table  27  J  Predictors  -  Means,  Standard  Deviations,  N’s, 
and  Tests  of  Significance  of  Mean  Differences 
Clerical  Workers 


M 

Predictor 

X 

8 

If 

Tbour  stone  Test  of 
Mental  Alertness 


Verbal 

Total 

29.67 

9.44 

226 

White 

32.02 

8.56 

95 

Negro 

22.45 

8.38 

31 

5.39** 

Qiantitative 

Total 

20.89 

7.65 

126 

White 

22.00 

7.68 

95 

Negro 

17.48 

6.56 

31 

2.92** 

Total 

Total 

50.56 

15.31 

126 

White 

5l». 02 

14. 34 

95 

Negro 

39-94 

13.34 

31 

4.79** 

Pictorial 

1*6.77 

7.78 

126 

Reasoning  Test 

1 

1*6.72 

8.29 

95 

■  1 

*♦6.39 

6.63 

31 

0.22 

(l)  t  ratios  are  between  the  means  of  the  white  and  Megro  samples. 

**  p  <  .01 
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Table  26:  Criteria  -  Means,  Standard  Deviations,  It's, 


and  Tests  of  Significance  of  Mean  Differences 
Clerical  Workers 


Criterion 

mm 

X 

8  1 

N 

td> 

Qiantlty  of  Work 

Total 

3-45 

.69 

126 

White 

3-51 

.66 

95 

Negro 

3 .26 

.73 

31 

1.77 

Qiality  of  Work 

Total 

3.46 

.78 

126 

White 

3-51 

.77 

95 

Negro 

3-32 

•79 

31 

1.18 

Accuracy 

Total 

3.40 

.78 

326 

White 

3-47 

.77 

95 

Negro 

3-23 

.80 

31 

1.48 

Knowledge  of 

Job 

Total 

3-56 

.72 

126 

White 

3.63 

.65 

95 

Negro 

3-39 

.88 

31 

1.38 

Job  Aptitude 

Total 

3.69 

.82 

126 

White 

3.78 

.77 

95 

Negro 

3.42 

.92 

31 

2.14* 

Flexibility 

Total 

3.13 

•  92 

126 

White 

3-18 

•93 

95 

Negro 

2.97 

.88 

33-  | 

1.15 

Overall 

Effectiveness 

Total 

3-51 

.77 

126 

White 

3-57 

•  75 

95 

Negro 

3-29 

.78 

31 

1.78 

1 

i 


i 


(l)  t  ratios  are  between  the  means  of  the  white  and  Negro  samples. 
*  P  < -05 
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Table  29:  Predictor  -  Criterion  Correlations  - 


Total  Group,  Whites  and  Negroes 
Clerical  Workers^  1 2>  2>i 
_ Predictor  _ 


Thur stone  Test  of  Mental  Alertness _ 

Pictorial 


Criterion 

Verbal 

1  Quantitative 

Total 

Reasoning 

N 

Quantity  of 

Total 

05 

05 

06 

07 

126 

Work 

White 

05 

10 

08 

16 

95 

Negro 

-22 

-28 

-30 

-22 

31 

Quality  of 

'  "1 

Work 

Total 

04 

08 

07 

07 

126 

White 

09 

17a 

lha 

15« 

95 

Negro 

-27 

-36* 

(7) 

-35 

-31 

31 

Accuracy 

Total 

-03 

-08 

-06 

-08 

126 

White 

-05 

-10 

-08 

-06 

95 

Negro 

-26 

-19 

-25 

-23 

31 

Knowledge 

of  Job 

Total 

-03 

-01 

-02 

05 

126 

White 

-07 

05 

-01 

15a 

95 

Negro 

-17 

-3U 

-28 

-27 

31 

Job  Aptitude 

Total 

+07 

06 

07 

-09 

126 

White 

06 

10 

09 

-06 

95 

Negro 

-23 

-23 

-26 

-23 

31 

Flexibility 

Total 

07 

01 

05 

02 

126 

White 

07 

03 

06 

05 

95 

Negro 

-11 

-18 

-16 

-10 

31 

Overall 

Effectivenes 

1  Total 

07 

02 

05 

-ou 

126 

White 

09 

05 

08a 

03 

95 

Negro 

-29 

-30 

-33 

-26 

31 

(1)  Decimals  are  omitted 

(2)  Number  in  parentheses  below  the  correlation  for  the  Negro  sample 
Indicates  the  model  illustrated  • 

a  Indicates  significant  difference  between  white  and  Negro  correlations. 
*  P  <  .05 
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consistent  negative  relationship  between  the  scores  on  both  tests  and  the  criteria 
was  found  for  the  Negro  sample.  All  28  predictor-criterion  correlations  were 
negative  for  the  lfegro  group,  while  21  of  28  correlations  were  positive  for  the 
white  group.  Moreover,  the  culture-free  examination  was  not  more  valid  than  the 
IMA  for  either  racial  group. 

Models  Illustrated 

These  data  in  this  study  provide  an  example  of  Model  7  where  there  is  valid¬ 
ity  for  only  one  subgroup.  The  relationship  between  the  ratings  of  quality  of 
work  and  the  TMA  -  Qiantltative  scale  presented  in  Table  30  Indicate  that  the 
white  subgroup  scored  higher  on  the  test  but  the  test  was  only  valid  for  the 
Negro  subgroup.  Since  an  increase  in  the  predictor  score  was  not  associated  with 
an  increase  in  the  criterion  rating,  the  test  is  not  appropriately  utilized  in 
this  instance.  Actually,  a  better  predictor  score  for  the  Negro  subgroup  is  indic¬ 
ative  of  poorer  job  performance.  This  illustration  meets  the  additional  criterion 
of  a  significant  difference  between  validity  coefficients. 

Table  31  presents  the  results  of  the  analysis  of  covariance  for  homogeneity 
of  regression.  The  statistic,  which  simultaneously  tests  the  equality  of  re¬ 
gression  slopes  and  intercepts,  was  significant  in  11  out  of  28  instances.  A 
significant  ?2  statistic  (p  <  .05)  found  in  seven  Instances  indicated  that  the 
regression  slopes  were  not  equal.  These  seven  cases  included  the  five  instances 
in  which  a  significant  difference  had  been  found  between  the  white  and  Negro 
validity  coefficients.  The  four  significant  statistics  which  were  found  in¬ 
dicated  that  a  common  Intercept  was  inappropriate. 

Cultural  Deprivation 

Following  the  procedures  employed  in  the  previous  study,  the  sample  was 
divided  into  culturally  advantaged  and  culturally  deprived  groups  on  the  basis 
of  difference  scores  (the  difference  between  the  TMA-T  and  the  Pictorial 
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Clerical  Insurance  Workers 


Table  31:  Analysis  of  Covariance  for  Homogeneity  of  Regression 


Seasoning  Test  in  standard  score  units).  Ss  with  a  positive  difference  score 
(i.e.  those  who  performed  better  on  the  culture-bound  exam)  were  considered  cul¬ 
turally  advantaged,  while  Ss  with  a  negative  score  (i.e.  those  who  performed 
setter  on  the  culture-free  exam)  were  considered  culturally  deprived.  An  analysis 
ras  performed  for  positive  and  negative  difference- score  groups  within  the  white 
ind  Negro  groups  and  for  the  total  white  and  total  Negro  groups.  Meaningful  data 
:ould  not  be  reported  for  the  Negro  group  with  a  positive  difference  score  because 
the  sample  size  (W*5)  was  too  small. 

Table  32  presents  the  validity  coefficients  for  the  1MA-T  and  the  Pictorial 
Reasoning  Test  for  each  of  the  subgroups.  Neither  test  exhibits  substantial 
ralldlty:  the  TMA  is  significant  (p  (  .05)  in  only  three  out  of  56  cases,  and 
the  Pictorial  Reasoning  Test  did  not  reach  significance  in  a  single  instance. 

However,  for  the  clerical  groups,  the  correlations  of  both  the  IMA  and  the 
Pictorial  Reasoning  Test  tended  to  be  positive  for  groups  identified  as  culturally 
idvantaged,  and  negative  for  groups  identified  as  culturally  deprived.  When  the 
IMA  was  used  as  a  predictor,  19  out  of  21  predictor-criterion  correlations  were 
positive  for  culturally  advantaged  groups,  and  23  out  of  28  were  negative  for 
mlturally  deprived  groups.  When  the  Pictorial  Reasoning  Test  was  used  as  a  pre- 
Lictor,  17  out  of  21  correlations  were  positive  for  culturally  advantaged  groups, 
md  23  out  of  28  were  negative  for  culturally  deprived  groups. 

The  difference  score-criterion  correlations  are  presented  in  Table  32  . 
igain  there  was  virtually  no  relationship  between  the  predictor  and  the  criterion, 
toe  of  58  correlations  was  significant  (p  <  .05)  which  was  less  than  would  be 
sxpected  by  chance. 

Thus  it  appears  that  the  Pictorial  Reasoning  Test  offers  no  advantage  over 
he  TMA  when  used  with  culturally  deprived  individuals  regardless  whether  cultural 
eprivation  is  defined  on  the  basis  of  race  or  a  difference  score  between  a  culture- 
ound  and  a  culture-free  test.  Furthermore,  the  utility  of  a  difference  score  as 
predictor  appears  questionable. 


Table  32:  (Contd) 


Study  #6:  Prediction  of  9uccess  in 
Narine  Corps  Training 


Sample 

The  sample  consisted  of  193  Marine  Corps  recruits  undergoing  training  at 
Service  Support  Schools.  Eighty-eight  of  these  were  Negro  and  105  were  white. 

The  sample  size  vas  smaller  for  specific  analyses  due  to  incomplete  predictor 
or  criterion  information.  Ss  were  enrolled  in  training  courses  for  food  service 
personnel,  supply  school  personnel,  and  motor  transport  personnel.  Personal  data 
for  the  Ss  are  summarized  in  Table  33*  Community  rank  is  a  measure  of  the  size 
of  the  community  in  which  the  recruit  had  spent  most  of  his  life,  and  was  rated 
on  an  eight  point  continuum  ranging  from  "in  the  open  country  or  in  a  faming 
community "  at  the  low  end  to  "in  a  suburb  of  a  very  large  city  (over  500 ,000 
people)”  at  the  high  end.  The  difference  between  Negro  and  white  samples  was 
significant,  with  Negroes  tending  to  reside  in  larger  communities.  Patherts  and 
mother*  s  education  was  likewise  rated  on  an  eight  point  continuum  with  the  lowest 
point  representing  no  education  or  incomplete  grade  school  and  the  highest  point 
representing  completion  of  graduate  or  professional  school.  There  were  no  differ¬ 
ences  between  the  racial  samples  on  the  amount  of  parental  education.  Fathers 
occupation  was  ranked  on  a  seven  point  scale  suggested  by  Hollingshead  (19^9) 
ranging  from  unskilled  workers  to  executives,  proprietors  of  large  concerns,  or 
major  professionals.  Salary  was  ranked  on  a  five-point  scale  in  $2000  increments 
from  less  than  $2000  to  $10,000  or  more.  Whites  were  significantly  higher  on 
both  father's  occupation  and  salary  measures.  There  were  no  significant  mean 
differences  for  either  age,  education,  or  the  mmiber  of  weeks  the  recruits  had 
been  attending  Service  Support  Schools.  Mean  ages  were  slightly  less  than  20  years 
mean  education  was  slightly  greater  than  completion  of  the  eleventh  grade;  and  the 
mean  number  of  weeks  in  school  was  slightly  greater  than  three. 
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Table  33  :  Personal  Data 


Group 

X 

s.d. 

t<1 2> 

Cosnunlty 

Total 

3-44 

2.17 

187 

Rank 

White 

2.89 

2.05 

102 

Negro 

4.09 

2.14 

85 

3.89** 

Father’s 

Total 

■vn|| 

1.36 

147 

Education 

White 

1.38 

84 

Negro 

3.2k 

1*35 

63 

•  57 

Mother's 

Total 

3.59 

160 

Education 

White 

3.68 

90 

Negro 

3. M 

■SHI 

70 

.94 

Father's 

wmm 

2.80 

170 

Occupation 

3.04 

96 

2.49 

!■ 

74 

2.83** 

Salary 

Total 

3.41 

■H 

104 

White 

3-75 

57 

Negro 

3.00 

47 

3.31** 

Age 

Total 

19.78 

1.67 

179 

(years) 

White 

19-77 

1.77 

98 

Negro 

19.78 

1.56 

81 

.03 

Education 

Total 

1.61 

187 

(years) 

White 

I.83 

102 

Negro 

1.30 

85 

.65 

Weeks 

Total 

3.38 

1.89 

186 

in  School 

White 

3.40 

1.92 

101 

Negro 

3.36 

1.86 

85 

.14 

(1)  Total  N  I3  less  than  193  because  of  incomplete  data  from  some  subjects. 

(2)  t  ratio 8  are  between  the  means  of  the  white  and  Negro  samples. 

**  p  <  .01 
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Predictors 


Three  tests  were  used  as  predictors:  the  Area  Aptitude  Test;  the  Armed  Forces 
Qualification  Test  (AFQT)j  and  the  Fundamental  Achievement  Series  which  yielded 
verbal,  numerical,  and  total  scores  (FAS-V,  FAS-N,  and  FAS-T,  respectively).  The 
Fundamental  Achievement  Series  is  a  test  of  elementary  verbal  and  numerical  skills, 
and  was  regarded  as  less  culture  bound  than  either  the  .Area  Aptitude  Test  or  the 
AFQP.  The  means,  standard  deviations,  and  the  test  of  significance  of  mean  differ¬ 
ences  are  presented  in  Table  3^ •  The  means  for  the  white  sample  were  higher  than 
the  Negro  sample  for  all  five  predictors  (p<  .01). 

Criteria 

The  criterion  of  final  class  standing  was  obtained  by  convertii^  rank  in 
class  to  a  stanine  score.  Although  class  standi of  the  whites  appears  slightly 
higher  than  that  of  the  Negroes,  the  difference  was  not  significant.  The  criterion 
mean,  standard  deviation,  and  test  of  significance  for  mean  difference  are  pre¬ 
sented  in  Table  35* 

Validity 

Predictor-criterion  correlations  for  demographic  information,  test  scores 
and  AFQT-FAS  difference  scores  are  presented  in  Table  3d  for  the  combined  groups 
as  well  as  the  Negro  and  white  subgroups.  All  test  scores  show  significant  valid¬ 
ity  for  the  white  subgroup  and  no  significant  validity  for  the  Negro  subgroup. 

The  only  variables  showing  significant  criterion  correlations  for  the  Negro  group 
were  age  and  education. 

Models  Illustrated 

a)  The  model  which  occurred  most  frequently  was  Model  7*  It  appeared  for 
six  of  the  fourteen  predictors  examined,  including  all  of  the  predictor  tests 
except  the  Area  Aptitude  Test,  as  well  as  community  rank  and  father's  occupation. 


Table  3^:  Predictors  -  Means,  Standard  Deviations,  N's, 
and  Tests  of  Significance  of  Mean  Differences 


Predictor 

■ 

X 

s 

t(2) 

Area  Aptitude 

Total 

97.49 

13. 7fc 

192 

Test 

White 

100.67 

16.08 

105 

Negro 

93.67 

8.91 

87 

3.79** 

Armed  Forces 

39.85 

19.98 

189 

Qualification 

47.70 

21.09 

102 

Test 

30. 64 

13.83 

87 

6.63** 

Fundamental 

Achievement 

Series 


Verbal 

Total 

White 

Negro 

70.20 

80.67 

77.  44 

8.30 

7.78 

8.60 

187 

102 

85 

2.68** 

Numerical 

Total 

53-  31 2* 

7.39 

187 

White 

55-90 

7.13 

102 

Negro 

50.27 

6.51 

85 

5.56** 

Total 

Total 

132.58 

13.65 

187 

White 

136.57 

13.  M 

102 

Negro 

127.29 

12.  40 

85 

4.59** 

(1)  Total  N  is  less  than  193  because  of  Incomplete  data  for  some  subjects. 

(2)  t  ratios  are  between  the  means  of  the  white  and  Negro  samples. 

**  p  <  .01 
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Table  35:  Criterion  -  Mean,  Standard  Deviation, 
and  Test  of  Significance  for  Mean  Difference 


Criterion 

X 

s.d. 

N 

t(U 

Class  Standing  Total 

U.78 

1.95 

187 

(  Standing  Score  ) 

White 

5-08 

2.00 

102 

Negro 

k.kl 

1.83 

85 

.25 

(l)  t  ratio  is  between  the  mean  of  the  white  and  Negro  samples. 


Table  3&:  Predictor  -  Criterion  Correlation 
Total  Group,  Whites,  and  Negroes^' 1  2) 


Predictor 

HI 

Hi 

N 

Area 

Aptitude 

Total 

White 

Negro 

19** 

19 

07 

(n) 

186 

102 

84 

Armed  Forces 
Qualification 
Test 

Total 

White 

Negro 

35** 

47** 

05a 

(7) 

183 

99 

84 

nusxamehtal 

Achievement 

Series 

— Verbal 

Total 

White 

Negro 

23** 

30** 

09 

(7) 

181 

99 

82 

ftunerical 

Total 

White 

Negro 

23** 

23** 

12 

(7) 

181 

99 

82 

Total 

Total 

White 

Negro 

26** 

30** 

11 

(7) 

181 

99 

82 

Comsunity 

Rank 

Total 

White 

Negro 

-19** 

-20* 

-09 

(7) 

181 

99 

82 

Father' 8 
Education 

Total 

White 

Negro 

15 

11 

19 

144 

83 

61 

Predictor 

Group 

_ r _ 

F 

Mother's 

Education 

Total 

White 

Negro 

H 

156 

86 

68 

Father*  s 
Occupation 

Total 

White 

Negro 

164 

93 

71 

Salary 

Total 

White 

Negro 

09 

04 

-02 

101 

56 

45 

Age 

Total 

White 

Negro 

39** 

43** 

33** 

173 

95 

78 

Education 

Total 

White 

Negro 

33** 

4o** 

23* 

181 

99 

82 

Weeks  in 
school 

Total 

White 

Negro 

03 

07 

-04 

180 

98 

82 

Difference 

Score  (AFQT 
minus  FAS-T 
standard  score 

unit) 

Total 

White 

Negro 

_ 

12 

24* 

-11a 

(5) 

177 

97 

80 

(1)  Decimals  are  omitted. 

(2)  Number  in  parentheses  below  the  correlation  for  the  Negro  sample  in¬ 
dicates  the  Model  illustrated . 

a  Indicates  significant  difference  between  white  and  Negro  correlations. 

*  P  <-05 

**  p  <  .01 
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In  these  Instances,  significant  mean  differences  were  found  for  the  predictor, 
but  not  for  the  criterion.  In  each  instance,  the  predictor  vas  valid  for  the 
white  sample  but  not  for  the  Negro  sample.  Since  Negroes  scored  significantly 
lover  on  the  predictors  but  not  on  the  criterion  (the  sole  exception  being  that 
whites  came  from  smaller  communities  than  the  Negroes,  and  hence  had  a  lower 
predictor  mean  in  that  instance),  the  employment  of  these  predictors  as  selec¬ 
tion  instruments  would  be  unfair  to  the  Negroes.  The  most  striking  example  of 
Model  7  is  that  of  the  AFQT,  where  the  white  correlation  was  significantly 
different  from  the  Negro.  The  widespread  use  of  the  AFQP  despite  its  lack  of 
validity  for  the  Negro  sample  raises  serious  concerns. 

b)  Two  examples  of  Model  5  occurred  where  the  level  of  the  mother's  educa¬ 
tion  and  the  difference  score  were  significantly  (p  <  .05)  related  to  class  stand 
ing  for  the  whites,  but  not  for  the  Negroes.  No  mean  differences  between  racial 
groups  were  observed  for  either  the  predictors  or  the  criterion.  Lack  of  valid¬ 
ity  for  the  Negro  sample  makes  these  variables  Inappropriate  as  predictors  for 
the  total  group. 

The  analysis  of  covariance  for  homogeneity  of  regression,  the  results  of 
which  are  presented  in  Table  37  shows  a  significant  Pj  statistic  in  six  instances: 
prediction  by  the  FAS-V,  mother's  education,  salary,  age,  education,  and  the  dif¬ 
ference  score.  The  F^  statistic  simultaneously  tested  the  hypothesis  that  both 
the  regression  shapes  and  the  intercepts  were  equal  for  the  two  groups.  The  F£ 
statistic  indicated  the  appropriateness  of  separate  regression  slopes  vere  slgnlfl 
cant  for  the  AFQT  and  the  difference  score.  The  F^  statistic  Indicated  the 
necessity  for  separate  intercepts  for  the  two  racial  groups  when  using  either 
mother's  education,  salary,  weeks  in  school,  age,  or  education  as  predictors. 

Cultural  Deprivation 

Since  Guion  (19&6)  has  suggested  that  the  difference,  in  standard  score  units 
between  a  culture-bound  test  and  a  culture-free  test  may  be  an  indicator  of  cul¬ 
tural  deprivation,  such  a  measure  may  be  used  as  a  predictor  itself,  or  it  may 
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Table  37:  Analysis  of  Covariance  for  Homogeneity  of  Regression 


U) 


Predictor 

Fx(2) 

f2(3) 

F^> 

Area  Aptitude 

1.92 

3.05 

.77 

(2,181) 

(1,181) 

(1,182) 

Fundamental  Ach.  Series 

3.26* 

2.95 

3.53 

Verbal 

(2,177) 

(1,177) 

(1,178) 

Numerical 

1.09 

•  52 

I.67 

(2,177) 

(1,177) 

(1,178) 

Total 

1.72 

1.60 

1.85 

(2,177) 

(1,177) 

(1,178) 

AFQT 

2.65 

5.29* 

.00 

(2,179) 

(1,179) 

(1,180) 

Conmunity  Rank 

1.84 

.79 

2.89 

(2,177) 

(1,177) 

(1,178) 

Father' 8  Education 

1.90 

.22 

3.60 

(2,140) 

(1,140) 

(1,141) 

Mother* 8  Education 

4.44* 

3.72 

5.07* 

(2,152) 

(1,152) 

(1,153) 

Father's  Occupation 

1.50 

.70 

2,29 

(2,160) 

(1,160) 

(1,161) 

Salary 

3.38* 

•  09 

6.73* 

(2,197) 

(1,197) 

(1,198) 

Weeks  in  School 

2.82 

.46 

5-19* 

(2,176) 

(1,176) 

(1,177) 

Age 

3.16* 

.26 

6.09* 

(2,169) 

(1,169) 

(1,170) 

Education 

3.68* 

•  30 

7.09** 

(2,177) 

(1,177) 

(1,178) 

AFQT  minus  FAS-T 

4.33* 

5.41* 

3.1 8 

(2,172) 

(1,172) 

(1,173) 

(1)  Degrees  of  freedom  for  each  ratio  are  shown  in  parentheses  below  the 
statistic. 

(2)  F^  tests  hypothesis  that  E(  |  Xj. )  =  a  bX-n  for  a11  1  g»*o«P** 

(3)  ?2  tests  hypothesis  that  E(Yj,|  Xj^J  ■  aj  ♦  bx^j  for  all  i  groups. 

(4)  Fo  tests  hypothesis  that  E(Yjj|  Xj«)  ■  a^  ♦  bjXij  for  all  i  groups. 

*  p  < .05 

**  p  ^  .01 
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be  used,  as  a  moderator  variable  to  divide  culturally  advantaged  and  culturally 
deprived  groups:  Both  of  these  suggestions  were  employed  in  this  study  by  using 
the  difference  between  the  A FQT  and  the  FAS-T,  in  standard  score  units.  Those 
with  a  positive  difference  score  were  considered  culturally  advantaged;  those  with 
a  negative  difference  score  were  considered  culturally  deprived.  Positive  and 
negative  difference  score  groups  were  also  identified  within  the  white  and  Negro 
groups. 

Validity  coefficients  for  each  of  the  predictors  as  well  as  for  the  differ¬ 
ence-score  are  presented  in  Table  38.  The  difference-score  only  correlated  with 
che  criterion  (p  <  .05)  for  the  white  group.  Since  the  magnitude  of  its  corre¬ 
lation  was  far  below  that  of  age,  education,  and  the  A FQJP,  its  usefulness  as  a 
predictor  is  questionable. 

As  a  moderator  variable,  the  difference  score  shows  relationships  similar 
to  those  found  for  the  race  variable.  Thirteen  of  the  fourteen  predictors  pre¬ 
sented  in  Table  38  had  validity  coefficients  of  a  larger  absolute  magnitude  in 
the  white  than  in  the  Negro  samples.  Likewise,  11  of  l4  coefficients  were  larger 
in  the  total  group  with  positive  difference  scores  than  in  the  total  group  with 
negative  difference  scores.  Although  the  same  trend  was  observed  in  the  Negro 
groups  (10  of  14  correlations  being  greater  in  the  positive  difference  score 
group),  validities  for  the  white  group  with  negative  difference  scores  were  not 
consistently  lower  than  those  for  the  white  group  with  positive  difference  scores. 

Age  and  education  provided  the  most  consistently  high  correlation  with  the 
criterion  across  cultural  groups.  Both  were  significantly  correlated  (p  <  .05) 
for  all  but  one  of  the  subgroups.  The  FAS  and  the  AFQT  had  a  tendency  to  be 
correlated  with  final  class  standing  only  for  those  subsamples  where  white  subjects 
are  Included.  Both  the  FAS-T  and  the  AFQT  were  significantly  (p  <  .05)  related 
to  the  criterion  in  every  subsample  except  tne  total  Negro  sample  and  the  Negro 
samples  divided  according  to  difference  score. 
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Table  38:  Correlations  with  Final  Class  Standing  for  Culturally  Advantaged 


Table  36:  (Contd) 


with 


Thus,  it  appears  that  cultural  deprivation,  as  measured  by  a  difference 
score  (in  standard  score  units)  between  a  culture -bound  test  and  a  culture -free 
test,  does  not  have  utility  as  a  predictor.  As  a  moderator,  it  offers  little 
beyond  the  more  obvious  moderator  of  race.  An  exception  to  the  above  statement, 
intimated  by  the  data,  may  be  a  tendency  to  achieve  greater  predictability  in 
white  groups  moderated  on  the  basis  of  the  difference  score. 

The  difference  score  was  tested  further  as  a  moderator  according  to  the 
method  suggested  by  Saunders  (1956).  The  total  sample  was  divided  into  back  and 
cross  samples  (N=*85  and  N=8U  respectively).  Cross-validated  multiple  R’s  using 
each  of  the  four  most  promising  predictors  (FAS-T,  AFQT,  Age,  Education)  individ¬ 
ually  with  the  difference  score  and  an  interaction  term  were  never  higher  than 
the  zero-order  correlation  coefficient  between  the  predictor  test  itself  and  the 
criterion.  Thus,  it  appears  that  when  this  procedure  is  used  the  utility  of  a 
difference  score  moderator  is  also  limited.  The  cross-validated  multiple  R  com¬ 
bining  the  FAS-T,  AFQT,  Age,  and  Education  was  .U5,  well  above  the  cross-validated 
statistics  using  the  difference  score  as  a  moderator. 

The  degree  of  shrinkage  when  all  predictors  were  combined  precluded  the 
utility  of  a  multiple  regression  equation:  All  multiple  R's  and  cross-validated 
statistics  are  presented  in  Table  39* 


Table  ,39:  Multiple  Regression  Analyses 


(l)  Sample  sizes  were  39  and  38  for  back  and  cross-sample  respectively 
when  using  all  predictors,  85  and  84  for  back  and  cross-samples 
respectively  for  all  other  analyses. 


Study  #7:  Work  Saaple  and  Psychomotor  Tests  as  Predictors  of 
the  Performance  of  Sewing  Machine  Operators 


Sample 

The  subjects  in  this  investigation  were  160  female  sewing  machine  operators 
employed  by  a  large  garment  manufacturing  company.  The  sample  included  67  white 
and  93  Negro  employees.  Table  ^0  presents  biographical  data  on  the  sample. 

The  white  and  Negro  subgroups  did  not  differ  in  either  age  or  educational  level. 

Procedure 

A  major  purpose  of  the  present  study  was  to  evaluate  a  work  sample  test  in 
terms  of  the  information  that  the  test  provided  concerning  the  nature  of  the  Job. 
It  was  hypothesized  that  a  work  sample  test  would  be  useful  as  a;i  aid  to  appli¬ 
cant  self-selection  for  the  Job  by  providing  the  applicants  with  a  realistic  pic¬ 
ture  of  the  work  envirorment.  Thus,  the  work  sample  test  provided  a  preview 
of  the  Job  that  could  help  an  applicant  determine  if  the  work  seemed  suited  to 
her  Interests  and  abilities. 

In  order  to  test  the  hypothesis  that  the  work  sample  test  would  be  an 
effective  source  of  Information  about  the  Job,  applicants  were  randomly  assigned 
to  one  of  three  experimental  groups.  Group  A  did  not  take  any  tests  prior  to 
employment.  Group  B  were  administered  two  perceptual  tests  (a  pinboard  and  a 
formboard).  The  perceptual  tests  were  given  in  order  to  ascertain  if  testing, 
per  ae,  had  any  Influence  upon  decisions  to  take  a  Job.  Group  C  were  adminis¬ 
tered  the  two  perceptual  tests  and  a  work  sample  test,  called  the  Career  Determin¬ 
ing  Exercises.  The  Career  Determining  Exercises  took  about  two  hours  to  complete 
and  was  composed  of  items  which  required  the  applicant  to  handle  pieces  of  fabric, 
to  thread  the  machine,  and  to  actually  operate  the  sewing  machine.  The  scoring 
of  the  work  sample  test  was  based  upon  time  required  to  complete  the  exercises 
and  where  appropriate  quality  of  the  performance,  e.g.,  the  actual  sewing  re¬ 
quired  for  some  parts  of  the  test. 
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23.69 


7.53 
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In  addition  to  a  total  score,  three  subscores  for  the  Career  Determinirg 
Exercises  were  obtained.  A  Sewing  Dexterity  subscore  was  caaputed  for  the  iteas 
which  were  concerned  with  the  applicants*  ability  to  handle  pieces  of  fabric. 

An  Equipment  Aptitude  subscore  was  obtained  for  those  iteas  concerned  with  the 
applicants  ability  to  thread  and  maintain  the  machine.  The  Machine  Control 
subscore  was  computed  for  the  items  which  required  the  applicant  to  actually 
sew  certain  standardized  patterns. 

Forty  subjects  were  assigned  to  Groups  A  and  B  and  80  subjects  to  Group  C. 

All  applicants  for  the  Job  were  offered  employment,  regardless  of  their  scores 
on  the  predictors.  Since  the  Ss  were  assigned  to  the  groups  on  a  random  basis, 
differing  proportions  of  white  and  Negro  applicants  were  assigned  to  the  three 
groups.  Group  A  was  composed  of  16  white  and  2k  Negro  workers  and  Group  C  con¬ 
tained  36  white  and  44  Negro  applicants. 

In  order  to  assess  the  effectiveness  of  the  Career  Determining  Exercises  as 
a  self -select ion  aid,  Groups  A,  B,  and  C  were  compared  with  respect  to  voluntary 
turnover  within  the  first  six  weeks  of  employment.  It  was  hypothesized  that 
Group  C,  which  had  been  administered  the  Career  Determining  Exercises,  would  have 
a  higher  percentage  of  applicants  who  would  refuse  employment  and,  consequently, 
have  a  lower  voluntary  turnover  rate  than  Groups  A  or  B. 

The  turnover  criterion  was  developed  by  classifying  all  employees  into  one 
of  six  categories.  The  categories  were:  Remaining  on  the  Job;  Refused  Employment; 
Terminated  Due  to  Lack  of  Progress;  Terminated  Due  to  Absenteeism;  Voluntary  Quit; 
and  Involuntary  Quit.  The  Involuntary  Quit  category  included  those  workers  who 
had  to  quit  their  Jobs  for  such  reasons  as  moving  from  the  area,  sickness  in  the 
family,  etc.  Voluntary  Turnover  was  defined  as  the  sta  of  the  Voluntary  Stitt  and 
Termination  Due  to  Absenteeism  categories.  The  Termination  Due  to  Absenteeism 
category  was  Included  in  Voluntary  Turnover  because  the  workers  in  this  category 
had  actually  withdrawn  from  the  organization.  In  most  of  these  cases  the  workers 
were  formally  terminated  by  the  company  only  after  it  was  apparent  that  the  workers 
did  not  intend  to  return  to  the  Job. 
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The  validities  of  the  Career  Determining  Exercises  and  the  tvo  perceptual 
tests  (the  pinboard  and  formboartl )  were  also  examined.  Criteria  Included  turn¬ 
over  within  six  weeks  of  employment  and  progress  in  training.  Two  measures  of 
training  progress  were  used,  single  cycle  time  and  production  rate.  Single 
cycle  time  measured  the  amount  of  time  required  for  the  sewing  machine  operator 
to  complete  one  unit  of  the  task  that  she  was  learning.  Production  rate  re¬ 
ferred  to  the  number  of  units  completed  ner  hour.  Since  the  operators  were  being 
trained  for  a  mxnber  of  different  sewing  operations,  both  single  cycle  time  and 
production  rate  were  measured  on  a  dichotomous  scale.  That  is,  the  operator  wis 
scored  as  being  above  or  below  the  expected  rate  for  her  operation.  Each  speci¬ 
fic  operation  had  an  expected  single  cycle  time  and  production  rate  which  had 
been  previously  determined  by  time  and  motion  analysis. 

Progress  in  training  and  the  turnover  criterion  were  both  measured  at  three 
intervals:  two,  four,  and  six  weeks  after  employment. 

Predictor  Comparisons 

Table  Ui  presents  the  means,  standard  deviations,  and  tests  of  the  signific¬ 
ance  of  the  difference  between  subgroup  means  for  the  predictor  variables  used 
with  the  sewing  machine  operator  sample.  The  white  subgroup  scored  significantly 
higher  than  the  Negro  subgroup  on  all  predictors  except  the  Pinboard  .Perceptual 
Test. 

Criterion  Comparisons 

There  were  no  significant  differences  between  the  white  and  Negro  subgroups 
with  respect  to  either  of  the  training  criteria  measured  at  any  of  the  time  inter 
vals.  With  respect  to  the  turnover  criteria,  the  only  significant  difference  be¬ 
tween  the  white  and  Negro  subgroups  was  the  percentage  ranainlng  on  the  Job  at 
all  three  time  intervals  for  Group  C. 
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Table  41 :  Predictors  -  Means,  Standard  Deviations,  It's,  and  Tests 


of  Significance  of  Mean  Differences  -  Seeing  Machine  Operators 


Predictor 

Group 

mm 

s 

n 

t 

Pinboard 

(number 

correct) 

Total 

White 

Negro 

EH 

m 

8.47 

8.60 

8.23 

120 

51 

69 

0.77 

Fomboard 

Total 

69.33 

29.75 

120 

(in  seconds) 

White 

57.98 

16.15 

51 

Negro 

79.69 

35-27 

69 

4.48** 

Career 

Total 

78.00 

15.98 

80 

Determining 

White 

85.90 

9.14 

36 

Exercises  - 
Total 

Negro 

70.89 

17.29 

44 

4.86** 

C.DeEe  • 

Total 

38.97 

9.79 

80 

Sewing 

White 

te.53 

5.25 

36 

Dexterity 

Negro 

35.70 

11.37 

44 

3-55** 

C  eDe  B  •  * 

Total 

8.23 

3-71 

80 

Equipt. 

White 

9.5^ 

2.48 

36 

Aptitude 

Negro 

7.01 

4.17 

44 

3.36** 

CsDtE*  • 

Total 

30.04 

7.59 

80 

Machine 

White 

33.36 

6.12 

36 

Control 

Negro 

26.98 

I 

8.03 

44 

3.97** 

1 


Evaluation  of  Career  Determining  Exercises  as  an  Aid  To  3e If- selection 

Tables  42 ,  43,  and  44  present  the  percentage  data  for  the  various  turnover 

categories  measured  at  2,  4,  and  6  week  intervals,  respectively.  Tables  45, 

46,  and  47  present  the  same  data  for  the  combined  category  of  Voluntary  Turnover 
and  the  category  of  Remain  on  Job.  The  only  significant  difference  between  the 
experimental  groups  occurred  for  the  white  subgroup.  The  white  subgroup  in 
Group  C  (which  had  been  administered  the  Career  Determining  Exercises)  had  a 
significantly  lower  voluntary  turnover  rate  at  the  six  week  Interval  than  the 
white  subgroup  in  Group  B.  The  data  generally  revealed  that  the  administration 
of  the  Career  Determining  Exercises  appears  to  be  effective  as  an  aid  to  self- 
selection.  There  was  only  a  single  significant  difference  between  Group  C  and 
either  Group  A  or  B.  However,  in  all  Instances,  the  Group  C  voluntary  turnover 
rate  for  the  white  group  was  lower  than  the  white  employees  of  Groups  A  and  B. 

No  such  consistent  pattern  was  found  for  the  Negro  subgroup. 

The  reasons  for  the  differential  effect  of  the  Career  Determining  Exercises 
administration  upon  the  voluntary  turnover  rates  of  whites  and  Negroes  are  not 
clear.  A  possible  explanation  is  that  the  white  and  Negro  subgroups  viewed  dif¬ 
ferent  aspects  of  the  work  situation  as  the  important  determiners  of  work  atti¬ 
tudes  and  behaviors.  It  is  hoped  that  future  research  concerned  with  the  deter¬ 
minants  of  work  motivation  for  these  workers  will  help  to  clarify  these  data. 

Validity 

Correlations  of  the  various  predictors  with  the  training  criteria  are  pre¬ 
sented  in  Table  48  and  with  the  turnover  criteria  in  Table  ^9.  Generally,  low 
validities  were  obtained.  In  three  instances  training  criteria  were  predictable 
for  the  Negro  subgroup  and  in  seven  cases  the  turnover  criteria  were  predictable 
for  the  white  subgroup.  The  perceptual  tests  were  valid  in  more  instances  than 
the  work  sample  test  or  subtests,  but  the  nunber  of  significant  validity  coef¬ 
ficients  was  low  for  all  tests. 
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Table  42:  Tvo  Meek  Turnover  -  Percentage  Data  for  Voluntary 
Turnover  and  Category  of  Remain  on  Job 


Turnover  Category 

Group 

Voluntary  Turnover 

Remain  on  Job 

Total 

12.5% 

60.0% 

A  White 

12.5% 

75-0 % 

Negro 

12.5% 

50.0% 

Total 

10.0% 

75-0% 

B  White 

13-3% 

80.0% 

Negro 

8.0% 

72.0% 

Total 

6.2% 

63.8% 

C  White 

2.8% 

77.8%a 

Negro 

9-1% 

52.3%a 

®  Proportions  significantly  different  at  the  .05  level 


Table  ^3 :  Four  Week  Turnover  -  Percentage  Data  for  Voluntary 
Turnover  and  Category  of  Resale  on  Job 


a  Proportions  significantly  different  at  the  .05  level 


Table  44 :  Six  Week  Turnover  *  Percentage  Data  for  Voluntary 
Turnover  and  Category  of  Remain  on  Job 


Group 

Turnover  Catei 

gory 

Voluntary  Turnover 

Remain  on  Job 

Total 

25.0% 

37.5% 

A  White 

31.2% 

*3.8% 

Negro 

20.9% 

33-3% 

Total 

25-0% 

52-5% 

B  White 

40.0%G 

*6.7  % 

Negro 

16.0% 

56.0% 

Total 

16.3% 

52.5% 

C  White 

11.2%^ 

66.7%b 

Negro 

20.5% 

40.9%b 

a  Proportions  significantly  different  at  the  .05  level. 
b  Proportions  significantly  different  at  the  .05  level. 
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Table  45 :  Two  Week  Turnover  -  Percentage  Data 


Table  46:  Pour  Week  Turnover  -  Percentage  Data 


Table  47:  Six  Week  Turnover  -  Percentage  Data 


Table  V8 ;  Correlations  of  Predictors  with  Criterion  of  Turnover  - 

Sewing  Machine  Operator sa 


Turnover  Category 


Voluntary  Turnover 

Termination  - 
Progress 

Re»a 

dn  on  Job 

Predictor 

Group 

Two 

Week 

Four 

Week 

I 

( 

Six 

Week 

Two 

Week 

Four 

Week 

Six 

Week 

Two 

Week 

Four 

Week 

Six 

Week 

Pinboard*1 

Total 

24* 

31** 

22* 

03 

1 

04 

00 

15 

17 

11 

White 

52** 

59** 

43** 

13 

17 

05 

33* 

38* 

22 

Negro 

09s 

(5) 

12a 

(5) 

14 

(5) 

-08 

-05 

04 

01 

(6) 

02 

(6) 

06 

Foraboardb 

Total 

08 

01 

00 

15 

02 

00 

14 

02 

00 

White 

-28 

-30* 

-35* 

11 

08 

03 

-03 

-07 

-14 

Negro 

20 

na 

(7) 

o6a 

)7) 

23 

-02 

-03 

24 

03 

02 

C.D.E.c 

Total 

11 

14 

10 

06 

10 

-08 

-03 

-14 

-10 

Total 

White 

18 

18 

18 

18 

13 

11 

23 

18 

16 

Negro 

-09 

-07 

06 

01 

-10 

-07 

-06 

-10 

-02 

C.D.E.c 

Total 

05 

02 

07 

06 

-05 

-02 

07 

05 

02 

Sewing 

White 

15 

16 

16 

25 

27 

24 

28 

30 

27 

Dexterity 

Negro 

10 

04 

20 

-01 

-10 

-06 

07 

-05 

-06 

C.D.E.c 

Total 

03 

07 

04 

04 

-06 

-03 

05 

-01 

00 

Equipt . 

White 

21 

22 

23 

18 

26 

27 

24 

31 

31 

Aptitude 

Negro 

-32 

-18 

-12 

11 

10 

11 

-16 

-02 

00 

C.D.E.c 

Total 

27* 

24 

24 

07 

-06 

-05 

-12 

-15 

-15 

Machine 

White 

11 

10 

10 

1  04 

-11 

10 

08 

-06 

-07 

Control 

Negro 

07 

15 

15 

1 

-03 

-13 

-12 

03 

-02 

01 

a  Correlations  are  point  biserial.  Deciaals  are  oaltted. 
b  White  N  -  44;  Negro  H  =  54. 
c  White  N  *  30;  Negro  If  =  33  - 

*  P  <  .05. 

**  p  <  .01 

a  Correlations  of  subgroups  significantly  different  at  .05  level. 
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Table  49:  Correlations  of  Predictors  with  Criterion  of  Training 


Progress  -  Sewing  Machine  Operators® 


- 

Training  Progress 

|  Two  Week 

Pour  Week  I 

Six  Week 

Predictor 

Group 

Single 

Cycle 

Prod. 

Single 

Cycle 

Prod. 

Single 

Cycle 

Prod. 

Pinboard6 

Total 

-10 

00 

-08 

Ok 

13 

17 

White 

-18 

-03 

-16 

-08 

-11 

lB 

Ifegro 

-02 

04 

00 

-01 

53a 

(5) 

19 

Formbo&rd* 

Total 

-32** 

-17 

16 

03 

-03 

-14 

White 

-21 

-04 

06 

15 

00 

-18 

Negro 

-37* 

(7) 

-14 

02 

04 

-18 

-01 

C.D.E.c 

Total 

03 

21 

07 

23 

09 

18 

Total 

White 

30 

28 

-11 

03 

-02 

19 

Ifegro 

-23 

08 

29 

30 

09 

11 

C.D.E.c 

Total 

17 

12 

22 

20 

02 

30 

Sewing 

White 

15 

31 

01 

02 

-34 

31 

Dexterity 

Negro 

22 

-03 

48 

30 

44 

34 

C.D.E.c 

Total 

28 

31* 

12 

15 

-09 

-02 

Equipt . 

White 

10 

26 

11 

14 

06 

3^ 

Aptitude 

Negro 

43* 

(7) 

30 

20 

-04 

-33 

-51 

C.D.E.c 

Total 

-03 

13 

06 

17 

-17 

15 

Machine 

White 

-16 

22 

-21 

-07 

-22 

27 

Control 

Negro 

12 

02 

42 

30 

-08 

-03 

a  Correlations  are  point  biserial.  Decimals  are  omitted. 
b  Por  two  week  data,  White  N  =  31,  Negro  N  **  38;  for  four  week  data, 

White  If  =  26,  Ifegro  N  3 2 1  for  six  week  data.  White  N  *  20,  Negro  H  *  21 

c  Por  two  week  data,  White  N  =  25,  Ifegro  If  -  21;  for  four  week  data. 

White  If  =  22,  Ifegro  If  =  16;  for  six  week  data.  White  If  ■  19,  Negro  N  *  10 

*  p  <  .05. 

.01. 

a  Correlations  for  white  and  Ifegro  subgroups  significantly  different 
at  the  .05  level. 
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Models  Illustrated 


All  of  the  Bartlett -O'Leary  (1969)  models  found  with  the  sewing  machine 
operator  sample  vere  differential  validity  models.  There  were  no  cases  in  which 
a  predictor  was  valid  for  both  the  white  and  Negro  subgroups.  Four  instances 
each  of  Models  5  and  7  were  found.  Model  3  illustrates  the  situation  in  which 
no  significant  differences  are  found  between  subgroups  on  either  the  predictor 
or  criterion  measures  and  the  predictor  is  valid  for  only  one  subgroup.  The  pin¬ 
board  perceptual  test  was  valid  for  predicting  voluntary  turnover  for  the  white 
subgroup  at  all  three  time  intervals,  illustrating  Model  3*  The  pinboard  test 
was  valid  for  the  Negro  subgroup  in  predicting  the  single  cycle  measure  of  train¬ 
ing  progress  at  the  six  week  interval. 

Model  7  illustrates  a  situation  in  which  there  is  a  significant  difference 
between  subgroups  for  only  the  predictor  variable  and  the  predictor  is  valid  for 
only  one  subgroup.  The  relationships  between  the  formboard  perceptual  test  and 
various  criteria  constituted  three  of  the  examples  of  Model  7  found  with  the 
sewing  machine  operator  sample.  The  formboard  test  was  valid  for  the  white  sub¬ 
group  when  the  criterion  was  voluntary  turnover  at  four  or  six  weeks.  The  form- 
board  was  valid  for  the  Negro  subgroup  with  a  criterion  of  the  single  cycle 
measure  of  training  progress  at  the  two  week  interval.  The  Equipment  Aptitude 
subscore  of  the  Career  Determining  Exercises  was  valid  for  the  Negro  subgroup 
for  the  criterion  of  single  cycle  time  measured  at  the  two  week  interval. 

Model  6  was  found  for  two  predictor-criterion  relationship.''  with  the  sewing 
machine  operator  sample.  Model  6  illustrates  the  situation  in  which  the  subgroups 
differ  significantly  on  only  the  criterion  measure  and  the  predictors  valid  for 
only  one  subgroup.  The  pinboard  test  was  valid  for  the  white  subgroup  in  the  pre¬ 
diction  of  the  turnover  category  of  Remain  on  Job  at  the  two  and  four  week  intervals. 

The  imposition  of  the  additional  criterion  for  model  identification  of  a  sig¬ 
nificant  difference  between  subgroup  correlations  reduced  the  nvmber  of  models 
found  to  three  cases  of  Model  3  and  two  of  Model  7*  The  Pinboard  and  formboard 
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perceptual  tests  were  valid  for  the  white  subgroup  only  in  two  instances 
each  for  the  voluntary  turnover  category  in  which  significant  subgroup  dif¬ 
ferences  in  correlations  were  found.  The  only  instance  in  which  significant 
subgroup  correlation  differences  were  found  and  the  predictor  was  valid  for 
the  Negro  subgroup  occurred  with  the  relationship  of  the  Pinboard  Test  and 
training  progress  at  six  weeks  as  neasured  by  single  cycle  time. 
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Study  #6:  Learning  Measures  as  Predictors 
of  Task  Performance  in  Two  Ethnic  Subgroups 

Introduction 

During  the  continuing  controversy  about  the  utility  of  psychological 
tests,  some  researchers  have  seriously  questioned  whether  psychological 
tests  can  be  validly  employed  to  predict  minority  group  performance.  In 
their  review  of  the  learning  patterns  of  the  disadvantaged  3todolsky  and 
Lesser  (1967)  indicate  that  the  relationship  between  tested  intelligence 
and  performance  on  laboratory  learning  tasks  is  high  for  upper  socioeconomic 
groups  but  negligible  for  lover  socioeconomic  groups.  Moreover,  since  upper 
and  lover  socioeconomic  children  demonstrate  similar  performance  on  tasks  which 
do  not  require  transfer  from  previous  learning,  the  learning  ability  of  children 
from  lover  socioeconomic  backgrounds  may  not  be  adequately  reflected  in  general 
intelligence  tests.  Several  investigators  (Kirkpatrick,  Even,  Barrett,  and 
Katzell,  1968;  Mitchell,  Albright,  and  McMirray,  1968;  Ruda  and  Albright, 

1968)  have  noted  that  tests  may  not  have  the  same  degree  of  validity  for  min¬ 
ority  group  members  as  they  do  for  white  subgroups.  Also,  in  the  first  phase 
of  this  contract  (O'Leary,  Parr,  and  Bartlett,  1970)  the  investigators  freq¬ 
uently  found  that  tests  were  valid  only  for  the  white  sample  (e.g..  Model  5, 
Bartlett  and  O'Leary,  1969). 

In  view  of  these  data,  both  educators  and  employees  have  suggested  that 
there  is  a  need  for  the  development  of  new  testing  techniques  or  substitutes 
for  tests  (Sheppard  and  Strlner,  1968).  The  present  study  attempted  to  combine 
the  techniques  of  differential  psychology  and  the  learning  laboratory  to  obtain 
valid  predictors  for  minority  group  members. 


Rationale 


Jensen  (1968)  has  Inferred  "basic  learning  abilities"  froa  an  individ¬ 
ual's  performance  on  learning  tasks  which  are  relatively  free  froa  media- 
tional  processes  or  specific  transfer  froa  previous  learning.  If  an  individ¬ 
ual  has  good  basic  learning  ability  and  is  given  the  appropriate  enrlronaental 
input,  Jensen  states  that  he  shaald  be  able  to  acquire  the  learning  sets,  media  - 
tlonal  habits,  verbal  associative  networks,  and  the  reservoir  of  transferable 
skills  that  largely  constitute  educability  (1968).  The  learning  tasks  utilised 
by  Jensen  to  measure  these  "basic  learning  abilities"  include  selective  trial- 
and-error  learning,  free  recall,  serial  and  paired-associate  learning.  The 
experimental  conditions  were  designed  to  minimize  the  effects  of  prior  learning 
Ferguson's  theory  (195k,  1956)  on  learning  and  human  ability  postulated  a 
similar  point  of  view.  According  to  Ferguson,  "the  abilities  of  man,  includ¬ 
ing  the  reasoning,  maiber,  perceptual,  and  spatial  abilities,  and  whatever 
is  subsumed  under  Intelligence,  are  attributes  of  behavior,  which  through 
learning  have  attained  a  crude  stability  or  invariance  in  the  adult..."  (p.121) 
That  is,  abilities  are  defined  in  terms  of  performance  on  psychological  tests 
and  the  asymptotic  performance,  measured  by  these  tests,  is  considered  to  be 
a  crude  limit  of  learning.  Ferguson  conceptualizes  abilities  as  over learned 
acquisitions  -  their  stability  is  the  result  of  overlearning. 

Ferguson  suggests  that  what  is  learned  and  the  age  at  which  it  is  learned 
is  prescribed  by  cultural  factors.  Thus,  different  cultural  enviroments  lead 
to  the  development  of  different  patterns  of  ability.  The  results  of  Lesser, 
Flfer,  and  Clark  (1965)  support  this  point.  Theycoupared  middle -and -lower 
class  children  of  four  ethnic  groups:  Chinese,  Jews,  Negroes,  and  Puerto 
Ricans.  The  abilities  measured  were  Verbal,  Reasoning,  Numerical  and  Spatial. 
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While  an  overall  social  class  difference  in  mean  performance  was  observed, 
the  most  important  finding  was  that  the  pattern  of  abilities  vas  different 
for  each  ethnic  group.  This  pattern  vas  invariant  across  social  classes. 

Rapier  (1962)  has  also  supported  the  view  that  intelligence  tests  meas¬ 
ure  acquired  past  knowledge  or  require  the  use  of  past  knowledge  in  new 
situations.  The  assumption  is  that  an  individual's  past  learning  is  a  valid 
predictor  of  what  he  will  be  able  to  learn  in  the  future.  McGeoch  and  Irion 
(1952)  state  that  the  predictive  value  of  past  learning  rests  upon  two  assump¬ 
tions:  (l)  that  all  of  the  individuals  tested  have  had  an  equal  opportunity 
in  everyday  living  or  in  school  to  learn  the  materials  the  test  uses,  and 
(2)  although  all  individuals  have  had  an  equal  opportunity  to  learn  the 
materials  of  the  test,  there  has  been  some  differential  learning.  As  a  re¬ 
sult,  the  differences  which  appear  in  a  test  score  are  a  reflection  of  differ¬ 
ences  in  Intellectual  ability. 

An  Implication  of  these  views  is  that  differential  reinforcement  of  basic 
ability  patterns  could  result  in  various  minority  groups  being  at  different 
points  on  the  learning  curve.  Thus,  if  the  Negro' s  cultural  background  rein¬ 
forced  a  pattern  of  abilities  which  differed  from  that  of  the  white  subgroup, 
he  would  not  be  at  the  same  point  on  the  learning  curve  as  his  "equally  capable" 
white  counterpart .  The  frequently  observed  differences  in  mean  test  perform¬ 
ance  for  white  and  Negro  individuals  (Anastasl,  1966;  Krug,  1966)  nay  be  a 
function  of  the  differential  pattern  of  reinforcement  of  abilities  in  the  tvo 
cultural  groups.  Moreover,  ability  tests  measure  the  amount  an  individual 
has  learned  under  almost  Infinite  variations  in  the  conditions  of  learning. 
Current  psychological  tests  may  be  poor  predictors  of  performance  for  minority 
group  members  because  these  variations  in  the  conditions  of  learning  have  not 


been  taken  into  account. 
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It  is  possible  that  an  Ss  performance  in  an  actual  learning  situation  does 
not  depend  primarily,  as  it  does  vith  traditional  ability  tests,  upon  what  he 
has  learned  before  he  cooes  to  the  learning  situation.  The  individual  is 
simply  given  something  to  learn  and  the  amount  he  learns  is  used  as  a  measure 
of  his  ability.  Standardization  of  the  learning  situation  makes  the  condi¬ 
tions  of  learning  more  nearly  equal  for  both  majority  and  minority  group  members 
than  the  learning  required  for  aptitude  test  performance.  Moreover,  research 
studies  of  minority  group  performance  on  simple  learning  tasks  suggest  that 
mean  differences  in  performance  for  white  and  Negro  individuals  in  standard¬ 
ized  learning  situations  would  be  minimal  as  compared  to  mean  differences  in 
ability  test  performance  for  the  two  ethnic  groups. 

Some  researchers  have  employed  measures  of  performance  in  learning  situa¬ 
tions  as  predictors.  Frederiksen,  Carstater,  and  Stait  (19^7 )  suggested  that 
scores  derived  from  miniature  learning  situations  might  be  used  to  estimate 
an  individual's  performance  in  future  learning  situations,  i.e.,  final  course 
grades  might  be  predictable  from  how  well  the  student  learned  similar  concepts 
and  principles  taught  in  an  hour's  period  of  time  before  the  course  began. 
Allison's  ( 195^,  1956)  studies  at  U.S.  Naval  Training  Schools  demonstrated 
that  measures  of  learning  were  related  to  measures  of  success  in  schools  teach¬ 
ing  mechanical-motor  skills.  Also,  Allison  (1956)  found  better  predictions  of 
success  from  a  combination  of  the  learning  measures  and  the  Navy  Basic  Test 
Battery  than  from  the  Basic  Test  Battery  alone. 

Wardrop  (1967)  utilized  a  programed  instruction  unit  as  a  complex  minia¬ 
ture  learning  situation  to  predict  classroom  success.  In  the  first  of  two 
studies,  Wardrop  compared  the  predictive  validity  of  traditional  psychological 
tests  with  performance  on  an  85  minute  programmed  text  on  "Bow  to  Study"  using 
training  success  in  Naval  Electronics  and  Mechanical  Schools  as  a  criterion. 
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In  addition,  two  perceptual  and  psychomotor  learning  tasks  were  employed; 
the  DuBois-Bunch  Learning  test  and  a  lumbers  Test  (Hackett,  196k).  Pre- tests 
and  poet -tests  were  used  to  obtain  a  residual  gain  neasure  of  learning  on 
both  the  predictor  and  criterion  tasks.  Results  of  this  study  Indicated 
that  the  programed  learning  residual  gain  measure  was  approximately  equal  to 
the  traditional  psychological  test  (Navy  General  Classification  Test)  in  the 
prediction  of  classroom  performance;  r  =  2.1  vs.  r  =  .28  for  the  electronics 
school  and  r  =  .23  vs.  r  =  .30  for  the  mechanical  school.  Although  the  pro¬ 
grammed  Instruction  learning  measure  and  the  abilities  measure  showed  some 
overlap,  the  independent  component  of  the  programmed  learning  measure  when 
combined  with  the  ability  measure  resulted  in  a  somewhat  higher  validity 
coefficient  (Wardrop,  1967). 

The  subjects  in  Wardrop' s  second  study  were  students  in  an  introductory 
psychology  class.  Two  programmed  learning  tasks  were  utilized;  a  study  skills 
program  and  a  binary  numbers  program.  Tests  of  verbal  ability  and  numerical 
ability  were  the  traditional  psychological  tests  employed.  Neither  the  tradi¬ 
tional  tests  nor  the  residual  gain  measures  correlated  significantly  with 
final  course  grades.  The  only  significant  correlations  with  the  criteria 
were  the  post-test  scores  on  the  binary  numbers  program  and  a  combined  post¬ 
test  score  (binary  lumbers  post-test  score  plus  study  skills  program  post-test 
score).  Wardrop  concluded  that  the  results  of  these  studies,  vhlle  certainly 
not  definitive,  indicated  that  "learning  tests"  are  as  valid  as  traditional 
measures  of  Intelligence  in  the  prediction  of  classroom  performance. 

In  the  development  of  learning  measures  as  predictors,  one  Important  factor 
which  has  not  been  systematically  explored  is  the  analysis  of  the  criterion  task. 
The  situation  is  analogous  to  the  development  of  an  ability  test  battery  to 
predict  Job  performance  without  first  conducting  some  type  of  Job  analysis. 

One  reason  that  learning  tasks  have  not  consistently  predicted  criterion  per¬ 
formance  may  be  that  researchers  have  primarily  employed  perceptual  or  psycho- 
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motor  skills  as  predictors  (Wardrop,  1967)*  It  is  likely  that  the  more  the 
predictive  situation  is  similar  to  the  aritarlonsituatlon,  the  higher  the  resul¬ 
tant  validity  coefficients.  In  other  words,  the  more  similar  the  skills  required 
in  the  predictor  situation  are  to  the  skills  required  in  the  criterion,  the 
more  valid  the  predictor.  If  an  individual's  learning  is  highly  variable 
across  different  types  of  materials  and  across  different  learning  situations 
(Jenkins,  1967),  then  systematic  analysis  of  the  criterion  is  especially  Im¬ 
portant  in  the  development  of  learning  tasks  as  predictors. 

An  analysis  of  the  criterion  task  in  the  present  study  employed  Gagne's 
(1965)  eight  categories  of  learning  which  have  been  defined  in  tens  of  the 
different  sets  of  conditions  necessary  for  learning.  The  learning  tasK  was 
selected  to  represent  the  types  of  learning  utilized  in  the  criterion  task. 

In  summary,  current  psychological  tests  may  be  poor  predictors  of  minor¬ 
ity  group  performance  because  the  variations  in  the  conditions  of  learning  have 
not  been  taken  into  account.  The  present  study  employed  a  measure  of  the  sub¬ 
ject's  performance  on  a  miniature  learning  task  derived  directly  from  the  cri¬ 
terion  task  as  a  predictor  of  his  criterion  task  performance.  Moreover,  the 
conditions  of  learning  are  more  nearly  equalized  for  the  two  ethnic 
groups  than  the  previous  learning  required  for  performance  on  aptitude  tests 
in  an  attempt  to  control  the  bias  present  in  traditional  psychological  tests. 


Subjects 

Ninety-four  freshmen  and  sophomore  students  (46  vhite  and  46  Negro )  at 
the  University  of  Maryland  participated  in  the  five  hour  experiment.  All  sub¬ 
jects  were  obtained  through  ads  in  the  campus  newspaper  and  were  paid  ten 
dollars  for  their  participation.  There  were  two  experimental  sessions,  a 
two  hour  and  a  three  hour  session,  run  on  consecutive  evenings. 


The  Criterion  Task 

The  greatest  problem  encountered  by  researchers  validating  tests  for  Negro 
and  white  groups  is  obtaining  racially  mixed  groups.  Frequently,  Integrated 
classrooms  contain  only  a  few  Negroes.  Combining  classrooms  or  sections  to 
obtain  an  adequate  sample  of  minority  group  members  only  complicates  the  prob¬ 
lem  since  there  is  no  way  to  assess  the  equality  of  the  various  instructor's 
grading  and  Instructional  systems.  In  an  attempt  to  overcome  these  difficul¬ 
ties,  the  present  study  utilized  a  programed  Instruction  unit  as  a  criterion 
task.  Since  the  programmed  instruction  unit  attempts  to  equate  the  instruc¬ 
tion  for  all  subjects,  students  from  different  classrooms  can  be  combined  to 
obtain  an  adequate  sample  of  minority  group  members.  Moreover,  research  re¬ 
sults  indicate  that  learning  accomplished  through  programmed  Instruction  is  at 
least  as  effective  as  "conventional"  methods  of  instruction  (Stolurov,  1961). 

Chapter  four  of  J.  T.  Gibson's  (1968)  programmed  instruction  teat  indus¬ 
trial  Psychology  was  selected  as  the  criterion  task.  This  self-contained  pro¬ 
grammed  instruction  unit  covers  the  following  topics:  averages,  the  normal 
curve,  percentiles,  interquartile  and  semi-interquartile  ranges,  and  the  stand¬ 
ard  deviation.  Sample  frames  are  presented  in  Table  50 

The  128  frames  of  this  unit  were  divided  into  three  (3f"  by  &£")  booklets 
of  approximately  equal  length.  Each  frame  vas  presented  on  a  separate  page. 

The  subject  responded  by  writing  his  answer  on  a  separate  answer  sheet.  The 
subject's  response  consisted  of  either:  (l)  constructing  an  answer,  (2)  filling 
in  a  missing  vord,  or  (3)  selecting  the  correct  choice  from  several  alternatives. 
Immediately  following  each  question  frame  was  another  frame  containing  the 
correct  auswer.  No  time  limit  vas  imposed  on  the  subject. 

Three  measures  of  criterion  performance  were  obtained.  The  first  vas 
the  number  of  errors  in  each  of  the  three  instruction  booklets.  Also,  a  final 
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AVERAGES 


Table  501  Saaple  Trm ms  -  Criterion  Task 


1  Psychologists  often  study  behavior  in  order  to  generalize  about  features  that 
can  be  piedictcd  or  arc  typical  of  groups  of  people  Sometimes  a  group  of 
subjects  is  measured  for  some  characteristic  of  behavior,  such  as  time  for 
running  the  too-vard  dash.  The  performance  of  the  group  as  a  whole  can 
be  described  by  an  average  score. 

Averages  arc  used  to  (choose  one): 

a.  describe  individual  behavior. 

b.  make  generalizations  of  what  can  be  expected  of  groups  of  individuals. 


2  What  docs  average  mean?  What  is  the  average  individual  like?  The  term 
average  has,  in  fact,  a  variety  of  meanings.  It  has  been  used  to  describe 
C choose  one): 

a.  the  typical  individual. 

b.  the  individual  who  appears  most  often  in  the  group. 

c.  an  individual  whose  measured  characteristics  cause  him  to  fall  descrip¬ 
tively  in  the  middle  of  the  group. 

d.  all  of  these. 


3  Three  measures  arc  commonly  used  to  determine  an  average:  the  arithmetic 
mean,  the  midscorc  or  median,  and  the  mode.  Each  measure  means  some 
thing  different  and  is  determined  by  different  statistical  methods.  Thus  it 
follows  that  two  separate  measures: 

a.  often  give  different  estimates  of  average. 

b.  always  give  identical  estimates  of  average. 
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60  Another  property  identifying  the  normal  distribution  is  that  a  particular 
known  percentage  of  the  scores  falls  under  each  part  of  the  curve.  If  a 
perpendicular  line  were  drawn  from  the  baseline  to  the  two  points 
where  the  curve  changes  slope  from  convex  to  concave,  then  68%  of  the 
fetal  area  under  the  curve  would  be  cut  off  as  follows: 


Low  High 

Scores 


The  shaded  area  between  the  two  solid  vertical  lines,  drawn  from  the 
points  where  the  curve  changes  from  convex  to  concave,  represents 
-  %  of  the  cases  in  the  distribution. 
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level  of  achievement  was  measured  by  a  30  item  multiple-choice  test  (see 
Appendix  A).  All  questions  were  designed  to  measure  applications  of  concepts 
and  principles  to  new  situations  and  did  not  duplicate  frames  in  the  pi  n(n  n— ml 
instruction  text*  The  multiple  choice  test  vas  also  administered  prior  to  the 
programed  instruction  task  to  obtain  the  third  measure  of  criterion  perform¬ 
ance  -  a  measure  of  gain  in  proficiency.  Manning  and  Oufiois  (1958)  have  sug¬ 
gested  that  where:  (l)  trainees  differ  in  their  initial  level  of  performance, 
that  is,  their  prior  education  and  experience  has  lad  to  diversity  in  pre-train¬ 
ing  proficiency,  and  (2)  the  training  curriculun  does  not  ordinarily  result 
in  mastery  of  the  job,  but  rather  develops  skills  fundamental  to  efficient 
learning  on-the-job,  final  grades  may  not  adequately  reflect  the  performance 
of  the  trainees.  In  this  situation,  improvement  or  gain-ln-proficiency  is 

considered  to  be  the  most  significant  dimension  of  student  performance. 

✓ 

The  most  widely  used  measure  of  gain- in-proficiency  is  simply  the  differ¬ 
ence  between  final  and  initial  performance.  This  measure  of  change  has  been 
labeled  "crude  gain".  However,  many  researchers  (Manning  and  DuBois,  1962; 
Carver  and  DuBois,  1967;  Wardrop,  1967;  Cronbach  and  Fur by,  1970)  have  indic¬ 
ated  that  "crude  gain"  scores  are  noted  for  their  unreliability.  In  addition, 
crude  gain  scores  and  initial  performance  scores  are  usually  negatively  corre¬ 
lated,  resulting  in  an  over-prediction  of  learning  for  subjects  with  low 
initial  scores  and  an  unda?-prediction  of  learning  for  subjects  with  high  Initial 
scores.  In  an  attempt  to  compensate  for  the  deficiencies  of  the  "crude  gain" 
measure,  Manning  and  DuBois  (1962)  have  recommended  the  use  of  the  "residual 
gain"  measure.  If  7q  is  the  final  score  and  Z  i  the  initial  measure  (both 

expressed  in  Z  score  units),  Z^  can  be  divided  into  two  uncorrelated  parts: 

A 

^2  which  correlates  perfectly  with  Zj_,  and  Z2.I  which  is  uncorrelated  with  Z].. 
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Thus, 

residual  gain  *  -  Z^-  r^i  *  Z2.l 

Residual  gain  is  the  deviation  of  final  performance  scores  from  the  regression 
line  of  final  or  Initial  performance  scores. 

Manning  and  DuBois  (1962)  have  suggested  that  the  use  of  residual  gain 
scores  as  a  criterion  for  the  validation  of  selection  tests  would  serve  to 
facilitate  selection  procedures  oriented  toward  criteria  of  tra inability  or 
educability,  rather  than  achievement  at  a  particular  point  in  learning.  Since 
there  may  be  a  difference  in  the  level  of  performance  (both  pre  and  post)  for 
majority  and  minority  group  members,  this  type  of  criteria  measure  seems  rele¬ 
vant  for  the  validation  of  tests  for  minority  group  members.  These  mesfeers 
may  be  equally  "trainable",  as  compared  to  their  white  counterparts,  when 
measured  by  amount  of  gain  in  proficiency  rather  than  final  level  of  achieve¬ 
ment. 

Predictors 

The  experimental  learning  tasks  utilized  as  predictors  were  also  selected 
from  an  analysis  of  the  criterion  task  using  Gagne's  (1965)  eight  categories 
of  learning.  This  analysis  revealed  that  the  most  frequent  types  of  learning 
represented  in  the  criterion  task  were  concept  and  principle  learning. 

Three  different  types  of  learning  tasks  were  used:  a  paired -associate 
task,  a  concept  learning  task,  and  a  principle  learning  task.  All  the  learn¬ 
ing  tasks  were  presented  in  a  programed  instruction  format  similar  to  the  cri¬ 
terion  task.  Since  previous  researchers  indicated  that  paired-associate  learn¬ 
ing  predicted  academic  success,  the  paired -associate  task  was  included  even 
though  the  analysis  of  criterion  task  indicated  that  this  type  of  learning  was 
not  frequently  represented  in  the  criterion  task.  The  paired-associate  task 
was  a  modification  of  the  task  used  by  Duncanaon  (1966)  in  his  study  of  learn¬ 
ing  and  measured  abilities.  The  task  consisted  of  ten  paired-associate  items. 
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The  ten  response  terms  were  selected  from  Noble's  (1961)  list  of  high  M  value 
nouns.  The  stimulus  terms  were  nonsense  syllables  having  a  kj%  association 
value.  The  actual  pairs  were:  YAT- JEWEL,  TIS-DINNER,  ZUG -MONEY,  R34- VILLAGE, 

SOZ- INSECT,  ^JN-GARMENT,  XOW- HEAVEN,  NO L- WAGON,  RUH-OFFICE,  BEK- KITCHEN.  Each 
stimulus  term  vas  typed  on  a  separate  frame.  The  subject  then  responded  by 
writing  his  answer  on  a  separate  answer  sheet.  Immediately  following  each 
stimulus  frame  was  a  frame  which  contained  both  the  stimulus  and  response  terms. 

The  10  pairs  of  words  were  randomized  within  each  of  the  eight  trials. 

The  concept  task  was  a  modification  of  a  concept  learning  task  developed 
by  Allison  (i960).  The  task  consisted  of  16  sets  of  four  words  with  each  set 
being  assigned  to  one  of  the  code  letters  A,  B,  C,  or  D.  Four  sets  of  words 
belong  to  each  letter,  with  the  code  letter  representing  an  underlying  concept. 

The  following  concepts  were  employed:  concept  A  was  that  one  of  the  four  words 
was  a  number;  concept  B  was  that  one  of  the  four  words  was  a  sport;  concept  C 
was  that  two  of  the  four  words  were  homonyms;  concept  D  was  that  all  four  words 
were  units  of  measurement. 

Again  a  programmed  instruction  format  was  utilized  in  presenting  the  task. 

Each  set  of  words  was  typed  on  a  separate  frame.  The  subject  responded  by 
writing  his  answer  on  a  separate  answer  sheet.  Immediately  following  each  set 
of  words,  a  frame  was  presented  which  contained  the  set  and  the  correct  clas¬ 
sification  letter.  No  time  limit  was  imposed  on  the  subject.  The  order  of 
presentation  of  the  sets  of  words  was  randomized  as  well  as  the  spatial  order 
of  the  words  within  each  set.  Each  of  the  16  sets  of  words  was  presented  8 
times.  Table  51  presents  four  of  the  set3  of  words  used. 

The  principle -learning  task  was  adopted  from  Schoer’s  programmed  text.  An 
Introduction  to  Statistics  and  Measurement.  (1966)  Forty-two  frames  were  selected 
on  the  topic  of  nominal  and  ordinal  measurement.  Sample  frames  are  presented 
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Table  51:  Sample  Frames  -  Concept  Task 


I 


SLOP  SEVEN 

LIFT  MASK 

A 


TRACK  COOK 

TRAIL  HOCKEY 

B 


MIGHT  NIGHT 

PURPLE  MITE 
C 


INCH  MILE 

YARD  FOOT 

D 
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in  Table  52.  The  programed  instruction  format  of  this  task  vas  the  same  as 
the  one  used  in  the  criterion  task.  A  20  item  test  in  vhich  the  student  vas 
required  to  identify  whether  a  measure  vas  nominal  or  interval  vas  developed 
to  measure  final  level  of  achievement  as  well  as  gain  in  proficiency  (see 
Appendix  B). 

The  traditional  psychological  tests  used  as  predictors  were  the  Wonderllc 
Personnel  Test,  and  French’s  (1963)  Wide  Range  Vocabulary  (V-3)  and  Addition 
Test  (N-l).  Also  included  as  a  predictor  vas  a  Digit  Span  Test  developed  by 
the  Navy  Personnel  Research  Activity,  San  Diego,  vhich  yielded  three  scores; 
total  nunber  of  digits  correct,  length  of  span  vhere  first  error  occurred,  and 
longest  correct  span.  Both  the  instructions  and  the  problems  of  the  Digit 
Span  Test  vere  administered  by  tape  recorder.  The  test  involved  immediate 
memory  -  writing  digits  after  a  single  hearing.  The  spans  of  digits  ranged 
from  4  to  10  digits.  Jensen  (1968)  has  indicated  that  this  type  of  test  vas 
more  effective  than  conventional  intelligence  tests  in  predicting  school  grades 
for  individuals  from  low  socioeconomic  backgrounds. 

Predictor  Performance 

Table  53  presents  the  means,  standard  deviations,  and  tests  of  significance 
of  mean  differences  for  the  tvo  racial  groups  on  the  predictor  variables.  On 
the  experimental  learning  measures,  there  vere  no  differences  in  the  mean  per¬ 
formance  of  the  tvo  ethnic  groups  on  the  initial  trial  or  pre-test  measures. 
Although  there  vere  no  differences  on  the  final  trial  scores  of  the  paired- 
associate  task,  vhite  students  obtained  higher  scores  on  the  final  level  of 
achievement  measures  on  both  the  concept  and  principle  learning  tasks. 

The  mean  residualized  gain  scores  for  the  tvo  racial  groups  on  the  experi¬ 
mental  learning  measures  are  also  presented  in  Table  53*  There  vas  no  signific¬ 
ant  difference  in  the  amount  of  gain  on  the  paired-associate  task  for  the  tvo 
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Table  33:  Predictors  Means,  Standard  Deviations,  IPs  and 


Tests  of  Significance  of  Mean  Differences  - 


Learning  Measure  Study 


Predictor 

| 

Group 

X 

8 

N 

(1) 

Paired  -  Associate  Task 

First  Trial 

Total 

.17 

•  77 

9k 

White 

.02 

.15 

k6 

Negro 

•  31 

1.06 

k8 

1.86 

Final  Trial 

Total 

7.38 

2.83 

9k 

White 

6.83 

2.89 

k6 

Negro 

7.92 

2.69 

k8 

1.87 

Concept  Task 

First  Trial 

Total 

8.56 

3.06 

9k 

White 

9-09 

2.9k 

k6 

Negro 

8.06 

3.12 

k8 

1.63 

Final  Trial 

Total 

13-73 

3.09 

9k 

White 

lk.  70 

1.55 

k6 

Negro 

12.81 

3.85 

k8 

3.11** 

Principle  Learning  Task 


Pre-test 

Total 

20.70 

5-11 

9k 

White 

21.70 

9.05 

k6 

Negro 

19.79 

5.0k 

k8 

1.83 

Post-test 

Total 

28.39 

6.9k 

9k 

White 

30.98 

9.93 

k6 

Negro 

2S.83 

6.9k 

ke 

1 

3.82** 

(l)  t  ratios  are  between  the  means  of  the  white  and  Negro  samples 


«-* 


P  <  *03 
P  <  -01 


Table  53  (Contd) 


Predictor 

Group 

X 

8 

IT 

t 

Residual  Gain  Scores 

! 

I  ( 

1  f 

i  1 

1  1 

♦ 

Paired -associate  task 

Total 

.00 

1.00 

94 

White 

-.19 

1.02 

46 

Negro 

.18 

•  95 

48 

1.80 

Concept  task 

Total 

.00 

.96 

94 

White 

•25 

•  50 

46 

Negro 

-.24 

1.18 

48 

2.67* 

Principle  task 

Total 

.00 

.96 

94 

White 

•32 

.86 

46 

Negro 

-.31 

.95 

48 

3.33** 

Residual  Gain  Scores 
(using  subgroup  correlations) 


Paired -associate  test 

Total 

-.03 

1.00 

| 

94  | 

White 

-.25 

1.00 

46 

Negro 

.18 

•  95 

48 

2.11* 

Concept  task 

Total 

.00 

.94 

94 

White 

.25 

•  50 

46 

Negro 

-.25 

1.18 

48 

2.67* 

Principle  task 

Total 

.02 

1.00 

94 

White 

•  35 

.85 

46 

Negro 

-•31 

•  95 

48 

3.51** 

Wonder lie 

Total 

24.90 

5.76 

94 

White 

28.72 

4.04 

46 

Negro 

21.25 

4.71 

48 

8.15** 

Vocabulary  Test 

Total 

23.44 

8.45 

94 

White 

27.87 

7.09 

46 

Negro 

19.19 

7.54 

48 

5.68** 

Addition  Test 

Total 

36.78 

11.20 

94 

White 

37.48 

10.64 

46 

Negro 

36.10 

11.76 

48 

•  59 

Digit  Span  Test 

13.24 

Total  Digits  Correct 

Total 

122.06 

94 

White 

124.52 

9.80 

46 

Negro 

119-71 

15.61 

48 

1.78 

Length  of  Span 

Total 

6.86 

1.27 

94 

of  First  Error 

White 

6.65 

1.27 

46 

Negro 

7.06 

1.24 

48 

1.67 

Length  of  Longest 

Total 

8.34 

1.02 

94 

Correct  Span 

White 

8.52 

.69 

46 

Negro 

8.17 

1.24 

48 

1.68 
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ethnic  groups.  These  data  agree  with  the  lack  of  significance  found  between 
the  nean  performance  of  the  two  groups  on  both  the  initial  or  final  trial 
measures.  For  both  the  concept  and  principle  learning  tasks,  white  students 
obtained  higher  gain  scores  than  Negro  students.  Since  there  were  no  differ¬ 
ences  in  the  pre-test  measures,  these  data  are  also  reflected  by  the  higher 
scores  of  the  white  subgroup  on  the  final  achievement  measures. 

Since  the  results  of  the  residual  gain  analysis  were  dependent  on  the 
correlation  between  the  pre- and  post-test  measures,  the  use  of  subgroup  corre¬ 
lations  in  the  formats  for  computing  residual  gain  scores  might  produce  dif¬ 
ferent  results.  Table  53  presents  the  results  of  tue  residual  gain  analysis 
using  subgroup  correlations.  The  results  of  this  analysis  were  similar  to  the 
results  obtained  using  total  group  correlations  between  the  pre-  and  post-test 
measures.  The  only  exception  occurred  with  the  paired -associate  task.  The 
residuallzed  gain  scores  computed  using  subgroup  correlations  were  significantly 
different  for  the  two  ethnic  groups  with  the  Negro  group  showing  more  gain  in 
proficiency. 

White  students  scored  significantly  higher  than  Negro  students  on  both 
the  Wonderlic  Personnel  Test  and  the  Wide-Range  Vocabulary  Test.  There  was 
no  difference  between  the  mean  performance  of  the  two  groups  on  the  Addition 
Test  or  any  of  the  scores  on  the  Digit  Span  Test. 

Criterion  Performance 

Mean  criterion  scores  for  the  two  ethnic  groups  are  presented  in  Table  54. 
Negro  students  had  more  errors  on  all  three  booklets  of  the  programmed  instruc¬ 
tion  criterion. 

White  students  obtained  higher  scores  on  the  multiple-choice  criterion 
test  administered  both  before  and  after  the  students  were  exposed  to  the  pro¬ 
grammed  instruction.  Thus,  in  terms  of  final  level  of  achievement  white  students 
scored  higher  than  Negro  students. 


Table  54:  Criteria  -  Means,  Standard  Deviations, 
S'  a,  and  Tests  of  Significance  of  Mean  Differences  - 
Learning  Measure  Study 


f 

1 

Criterion 

Group 

X 

8 

N  I 

Error  Scores 


Booklet  I 

Total 

3.36 

3.10 

94 

White 

2.02 

1.86 

46 

Negro 

4.65 

3.50 

48 

4.53** 

Booklet  II 

Total 

11.57 

8.36 

94 

White 

9.84 

6.02 

46 

Negro 

13.23 

9.90 

48 

2.12* 

Booklet  III 

Total 

13.24 

8.52 

94 

White 

9.83 

5.20 

46 

Negro 

16.52 

9.76 

48 

4.12** 

Pre-test 

Total 

mmm 

3-78 

94 

White 

2.98 

46 

Negro 

3.34 

48 

6.25** 

Post-test 

Total 

17.10 

5.45 

94 

White 

20.07 

5.13 

46 

Negro 

14.25 

3.98 

48 

6.09** 

Residual  Gain  - 

Total 

.00 

.81 

94 

Criterion  Task 

White 

.21 

.90 

46 

Negro 

-.21 

.65 

4P 

2.57* 

Residual  Gain  - 

MM|M 

Criterion  Task 

.04 

.83 

94 

(using  subgroup 

.89 

46 

correlations) 

.64 

48 

3.72** 

(l)  t  ratios  are  between  the  means  of  the  white  and  Negro  samples 
*  P  <  .05 

**  p<  .01 
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Residual  gain  scores  were  computed  for  the  two  groups  using  the  pre-  and 


post-measure.  Inspection  of  Tabled  indicates  that  white  students  not  only 
had  higher  pre-  and  post-test  scores  but  they  -Iso  exhibited  more  gain  in  pro¬ 
ficiency  as  measured  by  the  residual  gain  scores.  A  similar  finding  was  ob¬ 
tained  using  the  residual  gain  scores  developed  using  subgroup  correlations. 

Validity 

Correlations  between  the  predictor  and  criterion  measures  are  presented 
in  Table  55-  Performance  on  the  paired-associate  task  did  not  predict  perform¬ 
ance  on  any  of  the  criterion  measures  for  either  racial  group  separately,  and 
correlated  significantly  with  only  two  measures  for  the  total  group  post-test 
and  residual  gain  scores  using  subgroup  correlations. 

Initial  level  of  performance  on  the  concept  task  correlated  with  error 
scores  on  all  three  booklets  for  both  the  Negro  group  and  the  total  group 
but  did  not  correlate  significantly  for  the  white  group.  Neither  final  level 
of  achievement  nor  residuallzed  gain  was  predictable  using  initial  level  of 
performance  as  the  predictor. 

Pinal  level  of  performance  on  the  concept  task  did  not  correlate  with 
final  level  of  achievement  on  the  criterion  test  for  either  ethnic  group 
separately,  but  did  correlate  with  final  level  of  achievement  for  the  total 
group.  Although  final  level  of  achievement  on  the  concept  task  predicted  error 
scores  on  Booklet  I  for  both  whites  and  Negroes,  it  predicted  errors  on  Book¬ 
lets  II  and  III  only  for  the  total  group.  Residuallzed  gain  scores  were  not  pre¬ 
dictable  using  final  level  of  achievement  on  the  concept  task. 

Initial  level  of  performance  on  the  principle  learning  task  did  not  pre¬ 
dict  any  of  the  criterion  measures  for  the  Negro  group  and  predicted  only  two 
criteria  for  the  total  group  i.e.,  final  level  of  achievement  and  residuallzed 
gain  scores  using  subgroup  correlations.  Final  level  of  achievement  on  the 


Table  55  :  Predictor  -  Criterion  Correlations 
Learning  Measure  Study ^ 

Predictor 


Criterion 

Group 

Paired  Associate  Task 

Concept  Task 

First  Trial  1  Final  Trial 

First  Trial 

Final  Trial 

Error  Scores 


Booklet  1 

Total 

03 

-11 

-32 

-44** 

White 

-08 

-21 

-2k 

-29* 

Negro 

-06 

-25 

-31* 

-37** 

(6) 

(1) 

Booklet  II 

Total 

04 

-13 

-27** 

.30** 

White 

03 

-12 

-14 

-21 

Negro 

-01 

-22 

-31* 

-26 

(6) 

Booklet  III 

Total 

03 

-08 

-26** 

-36** 

White 

01 

-2k 

-01 

-23 

Negro 

-06 

-16 

-32* 

-26 

(6) 

(11) 

Post  Test 

Total 

-29** 

-06 

09 

27** 

White 

-06 

05 

01 

15 

Negro 

-26 

-01 

-01 

15 

Residual  Gain  - 

Total 

-20 

-15 

02 

16 

Criterian  Task 

White 

-05 

-11 

-05 

16 

Negro 

-26 

-11 

-01 

07 

Residual  Gain  - 

Total 

-22* 

-13 

04 

20 

Criterian  Task 

White 

-05 

-05 

-03 

16 

(using  subgroup 
correlations) 

Negro 

-27 

-09 

-01 

09 

(1)  Decimals  are  omitted 

(2)  Number  in  parentheses  below  the  correlation  for  the  Negro  sample 
indicates  the  model  illustrated. 

*  P  < .05 

**  p  <  .01 
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Table  55  (contd) 


Criterion  Principle  Task  Residual  Gain  Scores 


Paired- 

Pretest 

Post test 

Associate 

Concept 

Principle 

Error  Scores 


Booklet  I 

Total 

-04 

-20 

-12 

-35** 

-19 

White 

-05 

-15 

-21 

-13 

-13 

Negro 

10 

-01 

-24 

-31* 

-03 

(8) 

Booklet  II 

Total 

-17 

-19 

-13 

-22* 

-15 

White 

-19 

-07 

-12 

-12 

-01 

Negro 

-11 

-15 

-22 

-19 

-13 

Booklet  III 

Total 

-08 

-19 

-09 

-29** 

-17 

White 

-06 

-09 

-24 

-23 

-07 

Negro 

01 

-04 

-16 

-20 

-04 

Post  Test 

Total 

35** 

51** 

-07 

43** 

White 

38** 

32** 

05 

20 

Negro 

22 

49** 

01 

45** 

(6) 

(1) 

(11) 

(8) 

Residual  Gain  - 

Total 

18 

32** 

-15 

16 

26** 

Criterion  Task 

White 

22 

25 

-11 

19 

18 

Negro 

03 

26 

-09 

07 

27 

(11) 

(11) 

Residual  Gain  - 

Total 

24** 

39** 

-12 

19 

33** 

Criterion  Task 

White 

30* 

29* 

-05 

18 

20 

(using  subgroup 

Negro 

06 

31* 

-07 

09 

31* 

conditions) 

(6) 

(1) 

(8) 

Residual  Gain  Scores 

Criterion  (using  subgroup  correlations) 


I  Paired-associate  1  Concept  i  Principle 


Booklet  I 

Total 

-10 

-20 

White 

-22 

-14 

Negro 

-24 

-31* 

(8) 

-04 

Booklet  II 

Total 

-12 

-22* 

-16 

White 

-12 

-12 

-04 

Negro 

-22 

-19 

-12 

Booklet  III 

Total 

-07 

-29** 

-18 

White 

-24 

-23 

-08 

Negro 

-16 

-21 

(11) 

-04 
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Table  55  (contd) 


Paired -assoc late 

Principle 

Post  Test  - 
Criterlan  task 

Total 

White 

Negro 

-09 

05 

01 

25** 

15 

16 
(11) 

46** 

27 

45** 

(6) 

Residual  Gain  - 

Total 

-15 

16 

30** 

Criterion  Task 

White 

-12 

19 

22 

Negro 

-09 

08 

27 

(11) 

Residual  Gain  - 

Total 

-13 

19 

36** 

Criterlan  task 

White 

-05 

18 

25 

(using  subgroup 
correlations) 

Negro 

-07 

09 

31* 

(8) 

Criterion 


Wonder lie 

HMI 

Addition 

Booklet  I 

Total 

-56** 

-32** 

-23* 

White 

-35* 

-08 

-34* 

Negro 

-44** 

-16 

-18 

(1) 

(11) 

(6) 

Booklet  II 

Total 

-38** 

-12 

-29** 

White 

-44** 

-11 

-31* 

Negro 

-29* 

03 

-27 

(1) 

(6) 

Booklet  III 

Total 

-5b** 

-27** 

-27** 

White 

-66** 

-31* 

-37* 

Negro 

-40** 

02 

-23 

(1) 

(8) 

(6) 

Post  Test 

Total 

63** 

54** 

31** 

White 

43** 

37** 

33** 

Negro 

4<j** 

36* 

35* 

(1) 

(1) 

(3) 

Residual  Gain  - 

1 

3b** 

36** 

24** 

Criterion  Task 

K 

37** 

36** 

21 

14 

17 

27 

mi 

(8) 

(8) 

(11) 

Residual  Gain  - 

Total 

45** 

42** 

28** 

Criterion  Task 

White 

41** 

38** 

27 

(using  subgroup 

Negro 

20 

21 

30* 

correlations) 

(8) 

(8) 

(0) 
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Table  55:  (contd) 


Criterion 

Digit  Span  Test 

Total  Digits 

Span  of 

Longest  Correct 

Error  Scores 

Correct 

First  Error 

Span 

Booklet  I 

Total 

-1+3*+ 

-12 

-1*1** 

White 

-ll* 

-07 

-16 

Negro 

-l*6*+ 

-31* 

-1*1*** 

(6) 

(6) 

(6) 

Booklet  II 

Total 

-27** 

-11 

-31** 

White 

-13 

-13 

-ll* 

Negro 

-26 

-17 

-33* 

(6) 

Booklet  III 

Total 

-31*** 

-10 

-33** 

White 

-16 

-23 

-04 

Negro 

-3U* 

-16 

-36* 

(6)  j 

(6) 

Post  Test 

Total 

16 

-05 

19 

tliite 

12 

07 

16 

Negro 

ou 

01 

11 

Residual  Gain  - 

Total 

12 

01 

14 

Criterion  Task 

White 

21 

12  | 

22 

Negro 

-02 

1 

O 

u> 

03 

Residual  Gain  - 

Total 

13 

-02 

16 

Criterion  Task 

White 

lo 

10 

20 

(using  subgroup 
correlations) 

Negro 

-01 

-03 

05 
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principle  learning  task  predicted  post-test  scores  and  residua. Iliad  gain  scores 
using  subgroup  correlations  for  both  racial  groups  but  did  not  predict  error 
scores  for  either  racial  group.  Residualized  gain  scores  using  total  group 
correlations  were  predictable  only  for  the  total  sample. 

Residualized  gain  scores  on  the  paired-associate  task  did  not  predict 
any  of  the  criteria  for  either  racial  group,  while  these  scores  on  the  concept 
task  predicted  error  rates  on  all  three  booklets,  as  well  as  post-test  scores 
for  the  total  group.  However,  none  of  the  criteria  were  predictable  for  each 
racial  group  separately  with  the  exception  that  error  scores  on  Booklet  I  were 
predictable  for  the  Hegro  sample.  Similar  findings  were  obtained  vith  resid¬ 
ualized  gain  scores  based  on  subgroup  correlations. 

Residualized  gain  scores  on  the  principle  learning  task  did  not  predict 
error  scores  on  the  criterion  for  any  group.  Post- test  scores  and  residualized 
gain  scores  on  the  criterion  task  were  predictable  for  the  total  group.  However, 
Inspection  of  subgroup  correlations  indicates  that  only  the  post-te-t  scores 
and  the  residualized  gain  scores  using  subgroup  correlations  were  predictable 
only  for  the  Hegro  sample. 

The  Wonderlic  Personnel  Test  predicted  error  scores  on  all  these  booklets 
and  final  level  of  achievement  for  both  racial  groups  as  well  as  the  total 
group  while  residual  gain  scores  were  predictable  only  for  the  total  group  and 
the  white  subgroup.  The  Wide-Range  Vocabulary  Test  predicted  final  level  of 
achievement  for  both  racial  groups  but  predicted  residual  gain  only  for  the 
white  and  total  sample.  Error  scores  were  not  predictable  using  the  vocabulary 
test  except  for  Booklet  III  for  the  white  subgroup. 

The  addition  test  predicted  error  scores  on  all  three  booklets  for  the 
white  sample  and  total  group  but  did  not  predict  error  scores  for  the  Hegro 
sample.  Although  final  level  of  achievement  was  predictable  for  both  racial 
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groups,  residual  gain  scores  on  the  criterion  test  were  predictable  only  for 
the  total  group.  Residua lized  gain  scores  using  subgroup  correlations  were 
predictable  only  for  the  total  group  aid  the  Negro  sample. 

Neither  the  post-test  scores  nor  the  resldualized  gain  scores  were  pre¬ 
dictable  using  the  Digit  Span  Test.  However,  the  total  digits  correct  score 
and  the  lowest  correct  span  score  predicted  error  scores  on  all  three  book¬ 
lets  for  the  total  group  and  predicted  errors  on  Booklets  I  and  III  for  the 
Negro  sample.  The  length  of  span  of  first  error  score  correlated  with  only 
one  criterion-error  scores  on  Booklet  I  for  the  Negro  sample. 

To  summarize,  perhaps  the  most  consistent  finding  was  that  performance 
on  the  paired -assoc late  task  did  not  predict  any  of  the  criterion  measures. 

The  concept  task  exhibited  some  validity  in  predicting  errors  for  the  total 
group  and  in  a  few  instances  for  the  Negro  sample.  On  the  other  hand,  the 
principle  learning  task  showed  its  greatest  validity  in  the  prediction  of 
final  level  of  achievement.  In  general,  the  validity  of  the  residual  gain 
measures  was  lower  than  that  of  the  final  level  of  performance  measures  but 
the  general  pattern  of  validity  for  the  two  types  of  measures  was  similar. 

The  Wonderlic  predicted  more  criteria  than  any  other  predictor.  In  fact, 
the  Wonderlic  predicted  all  criterion  measures  for  the  white  sample  and  all 
but  the  gain  measures  for  the  Negro  sample.  The  Vocabulary  and  Addition  Tests 
predicted  final  test  performance  for  both  racial  groups  but  neither  predicted 
errors  for  the  Negro  sample.  The  Digit  Span  test  did  not  predict  post-test 
performance  or  gain  in  proficiency  for  either  racial  group,  but  two  of  the  scores 
exhibited  validity  in  predicting  errors  scores  for  the  Negro  sample. 

Models  Illustrated 

Five  separate  models  were  illustrated  in  this  study.  The  specific  model 
illustrated  in  each  predictor-criterion  relationship  is  enclosed  in  parenthesis 
below  the  correlation  for  the  Negro  sample  in  Table  55,  The  most  frequently 


illustrated  model  was  Model  6,  represented  in  fifteen  cases.  The  relationship 
between  all  scores  of  the  Digit  Span  Test  and  error  scores  on  Booklet  I  are 
clear  illustrations  of  Model  6.  White  students  made  less  errors  than  Kegro 
students  but  there  was  no  difference  in  the  mean  performance  of  the  two  groups 
on  the  Digit  Span  Test.  Moreover,  the  test  was  valid  only  for  the  Kegro  sample. 
Although  this  situation  is  not  likely  to  lead  to  differential  selection  rates, 
it  is  not  an  optimal  selection  strategy  since  the  best  individuals  in  the  white 
sample  are  not  being  selected. 

Model  8,  most  clearly  illustrated  in  the  relationship  between  the  residual 
gain  scores  on  the  principle  learning  task  and  post -test  scores  on  the  criterion 
test,  was  illustrated  in  eleven  predictor-criterion  relationships.  Model  8  is 
illustrative  of  the  situation  where  a  difference  in  predictor  performance  is 
paralleled  by  a  difference  in  criterion  performance,  but  the  test  is  valid  only 
for  one  subgroup.  In  over  half  of  the  illustrations  of  this  model,  the  test 
was  valid  only  for  the  Kegro  sample.  It  is  ironic  that,  even  in  those  situa¬ 
tions  where  the  test  was  valid  solely  for  the  Kegro  sample,  the  use  of  total 
group  validation  procedures  would  result  in  test  bias.  This  occurs  because  the 
Kegro  group  scores  lower  on  the  predictor  and  would  be  less  likely  to  be  selected. 

Model  11,  as  illustrated  in  the  relationship  between  final  level  of  achieve¬ 
ment  on  the  concept  task  and  post -test  scores  on  the  criterion  test,  occurred 
eleven  times.  The  test  vas  not  valid  for  either  ethnic  group  separately  but 
showed  validity  for  the  total  group.  The  validity  in  this  situation  results 
from  the  fact  that  the  two  groups  differ  in  both  predictor  and  criterion  per¬ 
formance.  Failure  to  check  the  validity  in  the  two  groups  separately  would  re* 
suit  in  inadvertent  racial  discrimination. 

Model  1  vas  illustrated  in  eight  of  the  predictor-criterion  relationships. 

The  relationship  between  the  Wonderlic  and  the  criterion  post-test  clearly 
illustrates  Model  1.  White  students  scored  higher  on  both  the  Wonderlic  and 


the  criterion  post-test.  The  test  vas  valid  for  both  racial  groups.  For 
this  type  of  relationship  the  difference  in  predictor  performance  does  not 
represent  test  bias,  since  the  predictor  reflects  a  corresponding  difference 
in  criteria  performance. 

The  final  model  illustrated  in  this  sample  vas  Model  3*  The  mean  perform¬ 
ance  of  the  two  ethnic  groups  vas  approx imately  equal  on  the  Addition  test  but 
vhite  students  obtained  higher  scores  on  the  criterion  post-test.  The  test  vas 
valid  for  both  racial  groups.  If  a  total  group  validation  procedure  vas  utilised 
this  relationship  vouM  result  in  over- predict ion  of  criterion  performance  for 
the  Negro  sample. 

Discussion 

Results  of  the  present,  study  tend  to  support  the  earlier  findings  of 
Jensen  (1968)  and  Selmer  and  Iscoe  (1963)  indicating  that  there  is  no  differ¬ 
ence  betveen  vhites  and  Negroes  in  the  learning  of  paired-associate  tasks. 
Bovever,  in  the  present  study,  performance  on  the  paired -associate  task  vas 
not  related  to  performance  on  the  programmed  instruction  criterion  for  either 
racial  group. 

Although  measures  of  acquisition  in  paired-associate  tasks  frequently  shov 
substantial  correlations  with  measures  of  academic  performance,  there  have  been 
a  number  of  studies  vhich  nave  failed  to  shov  such  a  relationship  (Travers, 

1967).  The  failure  of  paired -associate  tasks  to  consistently  predict  class¬ 
room  learning  may  result  from  tne  factorial  complexity  of  the  classroom  situa¬ 
tion.  In  some  learning  situations  paired-associate  learning  may  be  an  important 
skill (i^.,  foreign  language  vocabulary  learning)  vhile  in  other  situations 
this  type  of  learning  may  not  be  important  (i.e.,  solving  mathematical  equations) 
The  more  similar  the  skills  required  in  the  predictor  task  are  to  those  required 
in  the  criterion  ta3k,  the  more  valid  the  predictor  should  be.  The  analysis  of 


the  criterion  task  in  the  present  study  indicated  that  paired -associate  learn¬ 
ing  was  not  frequently  represented  in  the  criterion.  Thus,  the  failure  of  the 
paired-associate  task  to  predict  criterion  performance  in  the  present  study 
may  result  from  the  fact  that  paired-associate  learning  conditions  vere  not 
present  in  the  criterion  task. 

Viewed  in  terms  of  mean  performance  levels,  the  use  of  the  concept  learn¬ 
ing  task  as  a  predictor  did  not  eliminate  differences  in  mean  predictor  per¬ 
formance  for  the  two  racial  groups.  In  fact,  although  there  was  no  difference 
in  the  initial  level  of  performance  for  the  two  groups,  white  students  obtained 
a  higher  level  of  final  achievement.  Also,  the  residual  gain  score  analysis 
revealed  that  white  students  shoved  more  gain  in  proficiency  than  the  Negro 
students. 

The  results  of  the  residual  gains  analysis  are  in  agreement  vlth  the  find¬ 
ings  of  Stabler  and  Perry  (1967).  They  compared  the  performance  of  white  and 
Negro  college  students  on  a  section  of  Bollard  and  Skinner's  programmed  text, 

The  Analysis  of  Behavior  .  Despite  the  fact  that  the  Negro  and  vhite  students 
vere  matched  on  the  basis  of  sex,  age,  IQ,  and  pretest  scores,  vhite  students 
obtained  higher  scores  on  the  post-test. 

The  concept  learning  task  did  exhibit  some  degree  of  validity  for  the  Negro 
sample  in  predicting  error  scores  but  did  not  predict  either  final  level  of 
achievement  or  gain  in  proficiency  for  either  racial  group. 

Since  the  analysis  of  the  criterion  task  revealed  mainly  principle  learn¬ 
ing  conditions,  it  was  predicted  that  the  principle  learning  task  would  be  the 
most  valid  predictor.  This  prediction  vas  not  supported  by  the  data  since  the 
Wonder lie  test  predicted  more  criteria  than  any  other  predictor.  Bovever,  final 
level  of  achievement  on  the  principle  learning  task  predicted  final  level  of 
achievement  on  the  criterion  test  and  residual  gain  scores  using  subgroup  cor¬ 
relations  for  both  racial  groups.  Moreover,  residual  gain  scores  on  the 
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princple  learning  task  predicted  final  level  of  achievement  and  residual 
gain  scores  using  subgroup  correlations  for  the  Negro  sample.  In  verms  of 
mean  performance  level,  the  use  of  the  principle  learning  task  did  not 
eliminate  racial  differences  as  white  students  obtained  higher  post -test 
scores  as  veil  as  larger  residual  gain  scores. 

As  a  result  of  this  study,  three  general  conclusions  concerning  learn¬ 
ing  measures  emerge.  First,  there  is  little  difference  in  the  learning  abil¬ 
ity  of  vbite  and  Negro  students  on  relatively  simple  learning  tasks  such  as 
the  paired -associate  task.  However,  performance  on  this  relatively  simple 
task  may  not  predict  performance  in  complex  learning  situations  such  as  the 
programmed  instruction  criterion  utilized  in  the  present  study.  Second, 
with  more  complex  learning  tasks  such  as  the  concept  and  principle  learning 
tasks,  white  students  obtain  higher  levels  of  achievement  and  also  exhibit  a 
larger  gain  in  proficiency  than  their  Negro  counterparts.  Thus,  it  is  unlikely 
that  the  use  of  such  measures  as  predictors  will  eliminate  mean  differences 
in  predictor  performance  for  white  and  Negro  subjects.  It  should  be  noted 
however,  that  it  is  not  always  desirable  to  reduce  mean  differences  in  pre¬ 
dictor  performance  if  there  are  corresponding  differences  in  criterion  perform¬ 
ance.  Third,  the  results  of  the  present  study  lend  partial  support  to  earlier 
studies  which  indicate  that  it  is  feasible  to  use  miniature  learning  situa¬ 
tions  to  predict  academic  performance.  In  fact,  inspection  of  Table  55, 
indicates  that,  at  least  with  some  criteria,  the  learning  measures  are  more 
predictive  of  criterion  performance  for  Negro  students  than  for  white  students. 
This  result  is  somewhat  encouraging  since  O'Leary,  Farr,  and  Bartlett  (1970) 
found  a  large  number  of  situations  in  which  traditional  tests  were  valid  for 
whites  but  not  for  Negroes.  Perhaps  miniature  learning  situations  can  be 
used  as  predictors  for  Negroes  in  such  situations. 
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The  traditional  psychological  tests  predicted  final  level  of  achievement 
for  both  racial  groups.  Errors  scores  were  predictable  for  both  racial  groups 


using  the  Wonderllc  as  the  predictor  but  were  predictable  only  for  the  white 
sample  using  the  Addition  Test.  The  Wonder  lie  and  the  Vocabulary  Test  pre¬ 
dicted  gain  scores  for  white  students  while  the  Addition  Test  predicted  gain 
scores  for  the  Negro  group.  The  Wonderlic  Test  predicted  more  criteria  than 
any  of  the  other  predictors. 

Although  there  was  no  difference  in  the  mean  performance  of  the  two  racial 
groups  on  the  Digit  Span  Test,  the  test  did  not  predict  either  final  level  of 
achievement  or  gain  in  proficiency  for  either  racial  group.  These  results  tend 
to  be  somewhat  at  variance  with  results  reported  by  Jensen  (1968)  indicating 
that  performance  on  digit  span  tests  was  predictable  of  academic  success. 

Error  scores  were  predictable  for  the  Negro  sample  using  the  subtest  scores  of 
the  Digit  Span  Test. 

In  conclusion,  the  results  of  the  present  study  indicate  that  miniature 
learning  situations  show  some  promise  as  predictors  of  academic  perfo nuance 
for  minority  group  members.  Use  of  the  research  paradigm  utilized  in  this  ex¬ 
ploratory  study  in  an  actual  training  program  seems  warranted. 


The  regression  tests  for  the  analysis  of  covariance  (Pottboff ,  1969) 
are  presented  in  Table  56.  'nils  analysis  indicated  that  there  vas  no  dif¬ 
ference  in  the  slope  of  the  regression  line  for  the  two  racial  groups  in  any 
of  the  predictor-criterion  relationships*  In  approximately  half  of  the 
predictor-criterion  relationships  there  vas  a  difference  in  the  intercepts 
for  the  two  racial  groups. 


U3 


1 

1 

Table  56:  Analysis  of  Covariance  for  Boaogenity  of  Regression 

Learning  Measure  Study ^ 


Error  Scores 


Booklet  1 

10.10** 

.09 

20.32** 

12.54** 

.81 

24.32** 

Booklet  2 

1.89 

.02 

3.78 

3.05 

.82 

5.29* 

Booklet  3 

8.40** 

.01 

16.98** 

9.72** 

.06 

19.57** 

Post  Test 

16.26** 

.06 

32.80** 

17.82** 

.09 

35.90** 

Residual  Gain 
Criterion 

Task 

2.62 

.03 

5.28* 

2.79 

.03 

5.61* 

Residual  Gain 
(Using  sub¬ 
group  Corre 
1st ions) 

5.79** 

.04 

11.66** 

6.24** 

.02 

12.60** 

Criterion 

Concept  Learning  Task 

First  Trial 

Final  Trial 

Fl 

f2 

f3 

Fl 

f2 

'3 

Error  Scores 

Booklet  1 

9.07** 

1.01 

17.22** 

5.89** 

.01 

11.91** 

Booklet  2 

2.13 

1.65 

2.58 

.71 

.03 

1.42 

Booklet  3 

9.18** 

3.58 

14.37** 

5.17** 

.01 

10.45** 

Post  Test 

17.70** 

.01 

35.7  ** 

15.00** 

.58 

29.56** 

Residual  Gain 
Criterion  Taa 

» 

;  3.^3 

.09 

6.85* 

2.98 

.96 

5.CT 

Residual  Gain 
(Using  sub¬ 
group  corre 
1st  ions) 

6.82** 

.03 

13.77** 

5.90** 

.96 

18.84** 

146 


Table  56  •'  (Contd) 


Predictor 


Principle  Learning  Task 


Po attest 


Booklet  1 

10.32** 

.58 

20.16** 

8.05** 

.23 

Booklet  2 

l.kk 

.00 

2.98 

1.12 

.29 

Booklet  3 

7.99** 

.07 

16.07** 

6.53* 

.01 

Post  Test 

L6.79** 

1.39 

32.07** 

10*1** 

.01 

Residual  Gain- 
Criterion  Task 

3.33 

1.2k 

5*1* 

1.38 

•30 

Residual  Gain 
(using  sub- 

6.78** 

2.03 

11.39** 

3.39 

•39 

group  corre- 

lationa) 

Predictor 


16.01** 


13.19** 

21.06** 


Criterion 


Wonder lie 


Vocabula 


Error  Scores 


Booklet  1  1.37 

Booklet  2  .19 

Booklet  3  .03 

Post  Test  2.3** 

Residual  Gain- 
CrlteriorlTasl  I.56 

Residual  Gain  2 . 10 
(using  sub¬ 
group  corre¬ 
lations) 


.9** 

5*3** 

.1* 

.38 

1.42 

.29 

.05 

5.1*6** 

1.27 

4.19 

6.91** 

•  35 

.16 

I.50 

2.13 

1.01 

2.69 

1.99 

10.  vr** 


13.57** 


Table  56:  (Contd) 


Predictor 

Criterion 

Addition 

Digit  Span 

Total  Digit  Corrects 

* 1 

*2 

F3 

F1 

*2 

F3 

Error  Scores 


Booklet  1 

9.83** 

.01 

19.87 

9.97** 

3.1* 

16.40** 

Booklet  2 

1.83 

.14 

3.56 

1.46 

.45 

2.49 

Booklet  3 

8.25** 

.01 

16.69** 

7.13** 

.75 

13-5*** 

Post  Test 

Residual  Gain- 

19.20** 

.22 

38.52** 

17.18** 

•39 

34.20** 

Crlterlan  Tasl 

3.19 

.04 

6.41* 

4.08* 

2.15 

5.93* 

Residual  Gain 
(using  sub¬ 
group  corre¬ 
lations) 

6.92** 

.21 

13.75** 

7.10** 

1.46 

12.68** 

Criterion 

Predictor 

Digit  Span 

Length  of  Span  of  First  Error 

Length  of  largest  Correct  Span 

*1 

P2 

*3 

F1 

r2 

F3 

Error  Scores 


Booklet  1 

13.40** 

2.67 

23.70** 

9.15** 

1.48 

16.72** 

Booklet  2 

2.55 

•27 

4.86 

1.46 

.57 

2.37 

Booklet  3 

9.60** 

.06 

19.34** 

7.89** 

1.85 

13.80** 

Post  Test 

18.16** 

.11 

36.56** 

17.06** 

•  53 

33-76** 

Residual  Gain* 
Criterion  Tasl 

3.84 

.63 

7.08* 

3-97* 

2.07 

5.80* 

Residual  Gain 
(using  sub¬ 
group  corre¬ 
lations) 

7.25** 

.44 

14.15** 

7.05** 

1.58 

12.43** 
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Table  56:  (Contd) 


Criterion 


Predictor 


Residual  Gain  Scores 


Paired -assoc late  Task 

n 

B 

B 

Q 

B 

* 1 

*2 

f3 

Error  Scores 


Booklet  1 

12.45** 

.79 

2k. 16** 

7-16** 

.20 

14.25** 

8.12** 

16.34** 

Booklet  2 

3.03 

.62 

5-24 

1.11 

.00 

2.34 

1.48 

.42 

2.56 

Booklet  3 

9.66** 

.06 

19.46** 

6.13** 

.09 

12.29** 

6.84** 

.00 

13.63** 

Post  Test 

17.91** 

.06 

36.14** 

15.56** 

.50 

30.79** 

12.51** 

.48 

24.69** 

Residual  Gain- 

Criterion  Task 

2.85 

•05 

5.71* 

3.33 

1.49 

5.14 

1.13 

.00 

3-30 

Residual  Gain 
(using  sub¬ 
group  corre¬ 
lations) 

6.30** 

.00 

12.75** 

6.24** 

1.23 

11.25** 

3.99* 

.00 

8.07** 

Criterion 


Predictor 


Residual  Gain  Scores  -  (using  subgroup  correlations 


Paired-assoc: 

Late  Task 

Principle 

Task 

B 

B 

B 

B 

B 

B 

fl 

r2 

p3 

Error  Scores 


Booklet  1 

12.62** 

.68 

24.64** 

7.13** 

.22 

14.17** 

7.94** 

.07 

15.97** 

Booklet  2 

3-10 

.78 

5.44** 

1.09 

.01 

2.20 

1-33 

.26 

2.41 

Booklet  3 

9.77** 

.04 

19-71** 

6.08** 

.08 

12.21** 

6.71 

.01 

13.56** 

Post  Test 

17.72** 

.04 

37.77** 

15.54** 

.50 

30.74** 

11.66** 

.07 

23.49** 

Residual  Gain- 
Criterion  Task 

2.74 

.07 

5.46* 

3.32 

1.49 

5.12 

1.50 

.09 

2.94 

Residual  Gain 
(using  sub¬ 
group  corre¬ 
lations) 

6.15** 

.00 

i 

12.43** 

i 

6.23** 

1.23 

11.21** 

3-71 

.10 

7.40** 

7o5 **  p  <  .01 


(1)  Degrees  of  freedom  for  all  comparisons:  F-(2,90);  F2-(l,90); 

(2)  F^  tests  hypothesis  that  F  (YjJXj^)  =  a  ♦  bXy  for  all  1  groups. 

(3)  Fg  tests  hypothesis  that  F  (YjjJx^j)  ■  aj  ♦  bX^j  fOT  *41  *  groups. 

(4)  F^  tests  hypothesis  that  F  ( )  *  a  ♦  blxlj  for  *41  *  groups. 

(▼slid  test  only  if  F2  is^jiot  significant) 


and  Conclusion* 


The  task  of  Integrating  the  aassive  aaount  of  data  ex sained  is  a  difficult 
one.  The  data  were  collected  in  such  a  vide  variety  of  settings  that  the  uncon¬ 
trolled  and  often  unknown  situational  variables  have  probably  had  far  greater 
effects  on  the  results  than  the  measures  included  for  study.  The  sample  sizes 
have  varied  widely  from  study  to  study  and  from  subgroup  to  subgroup.  In  order 
to  account  for  the  varying  sample  size  emphasis  in  interpretation  has  been  placed 
on  statistical  significance  rather  than  absolute  magnitude  of  validity  coeffic¬ 
ients.  This  decision  was  made  on  the  basis  of  an  assumption  that  no  validity 
can  be  considered  of  great  enough  magnitude  to  be  useful  in  actual  personnel 
selection  if  its  difference  from  zero  can  be  reasonably  attributed  to  chance. 

Interpretation  of  potential  bias  in  a  selection  Instrument  was  made  accord¬ 
ing  to  the  Bart Lett -O' Leary  differential  prediction  models  (1969)  and  the  defin¬ 
ition  of  test  bias  of  Cleary  (1966,  p.  l):  "A  test  is  biased  for  members  of  a 
subgroup  of  the  population  if,  in  the  prediction  of  a  criterion  for  which  the 
test  was  designed,  consistent  nonzero  errors  of  prediction  are  made  for  members 
of  the  subgroup.  In  other  words,  the  test  is  biased  if  too  high  or  too  low  a 
criterion  score  is  consistently  predicted  for  members  of  the  subgroup  when  the 
comon  regression  line  is  used."  The  presence  of  any  of  the  Bartlett -O' Leary 
models  would  then  indicate  bias  in  the  prediction  of  performance  by  using  the 
measure  for  selection,  the  exception  being  Model  1.  Model  1,  where  the  measure 
is  valid  for  both  groups  and  differences  in  performance  on  both  the  predictor  and 
the  criterion  are  in  the  same  direction,  may  result  in  identical  regression  lines 
for  both  subgroups,  with  accurately  predicted  performance  differences  resulting 
for  bot'i  subgroups.  The  selection  situation  described  vuider  Model  1  cannot  be 
considered  resulting  In  unfair  bias  since  differential  selection  from  the  sub¬ 
groups  vould  be  considered  a  valid  prediction  of  success. 
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Further  assumed  In  all  the  Interpretation*  of  Mas  is  that  the  criterion  is 
a  valid  Measure  of  performance.  Implicit  in  this  assumption  is  that  the  criterion 
measure  itself  is  unbiased.  This  assumption  must  be  subjected  to  serious  question. 
Caution  is  therefore  urged,  before  a  conclusion  is  drawn  that  any  selection  meas¬ 
ure  is  fair  because  it  validly  predicts  inferiority  for  a  given  subgroup.  The 
criterion  problem  is  one  that  has  always  plagued  all  areas  of  personnel  research. 
The  current  research  is  (unfortunately)  not  immune  from  this  plague. 

The  General  Validity  Picture 

The  sad  state  of  the  validity  of  psychological  tests  for  the  prediction  of 
Job  performance  was  well  summarized  by  Ghiselli  (1955)  more  than  fifteen  years 
ago.  Reported  validities  of  tests  were  found  to  rarely  exceed  .90  and  more  likely 
to  be  in  the  .30  -  .40  range.  Unreported  validities  or  in  situations  where  no 
validation  has  been  carried  out  may  not  present  even  this  optimistic  a  picture. 

Combining  both  phases  of  the  present  study,  more  than  1000  validity  compari¬ 
sons  have  been  examined  vlth  particular  emphasis  on  breaking  these  down  into 
ethnic  subgroups.  By  classifying  ths  relationships  found  according  to  the  Bart let t- 
0' Leary  differential  prediction  models,  it  vas  hoped  to  demonstrate  the  frequency 
for  which  a  measure  might  be  inappropriate  for  combined  group  prediction.  A 
Bartlett -O' Leary  model  was  illustrated  by  store  than  40%  of  the  situations  examined. 
These  cases  Indicated  the  inapproprlatness  of  using  a  regression  line  developed 

on  the  combined  group.  One  might  erroneously  conclude  on  the  basis  of  this  sum- 

* 

mary  that  prediction  on  the  basis  of  combined  group  analysis  would  thus  be  appro¬ 
priate  on  the  more  than  90%  where  a  Bart lett -0 ' Leary  model  vas  not  found. 

Bartlett  and  O'Leary  did  not  discuss  two  situations,  however,  because  they 
should  be  obvious,  but  nevertheless  attention  is  drawn  to  them  here.  The  first 
might  be  called  Model  0  (as  in  Zero),  where  the  measure  has  no  validity  in  the 
combined  group  or  either  subgroup.  Model  0  vas  found  in  almost  half  of  all  the 
validity  comparisons  asde.  Although  the  use  of  the  test  for  selection  will  not 
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necessarily  result  In  unfair  disc rial nation  in  this  situation  a  table  of  random 
nuebers  will  serve  the  save  purpose  sore  economically,  and  will  further  assure 
no  unfair  discrimination.  The  other  model  is  one  that  is  equally  as  obvious, 
tout  unfortunately  does  not  occur  often.  This  might  be  called  Model  V  (as  in 
valid),  where  the  measure  is  equally  valid  for  both  groups  and  where  there  are 
no  differences  on  either  the  predictor  or  criterion.  Model  V,  where  it  is  appro* 
priate  to  use  the  predictor  for  selection  on  both  subgroups  combined,  was  found 
in  fever  than  10%  of  the  situations  analyzed.  Most  of  the  validities  determined 
appropriate  were  found  for  one  job  (merchandise  handler)  where  multiple  measures 
for  both  predictors  and  criterion  led  to  a  large  number  of  comparisons.  If 
this  particular  job  situation  had  not  been  included.  Model  V  would  have  occurred 
in  fever  than  2%  of  the  comparisons  made. 

The  large  number  of  situations  found  vhere  the  predictors  were  completely 
inappropriate  and  the  very  few  found  where  the  predictors  were  appropriate  strongly 
support  the  need  for  situation  specific  test  validation  for  all  situations. 

The  Search  for  the  Unbiased  Predictor 

The  goal  of  this  phase  of  the  project  has  been  stated  to  be  to  examine  a 
wider  variety  of  predictors  in  hopes  of  determining  a  way  of  eliminating  the  test 
bias  found  prevelant  in  the  more  traditional  measures  studied  in  Phase  I  (O'Leary, 
Farr,  and  Bartlett,  1970).  Table  57  reports  the  frequency  of  occurrence  of  the 
Bartlett -O' Leary  models  from  Phase  I.  On  the  basis  of  these  765  comparisons  with 
these  traditional  predictors  it  was  concluded  that  test  bias  is  clearly  present 
in  a  large  nunber  of  cases  where  heterogeneous  groups  are  combined  in  making 
predictions.  Unless  other  kinds  of  measures  can  be  developed  which  do  not  lead 
to  such  bias,  differential  test  validation  must  be  carried  out  on  identifiable 
subgroups  of  the  population.  The  traditional  model  which  assumes  homogeneous 
groups  i-  clearly  inappropriate. 
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This  second  phase  of  the  study  has  investigated  less  traditional  predictors 
In  the  hopes  of  finding  procedures  which  can  be  used  to  reduce  test  bias. 

Table  58  presents  a  frequency  tabulation  of  Bart Lett -O' Leary  Model  occurrence  with 
the  less  traditional  predictors.  Additional  information  was  also  obtained  on 
traditional  selection  measures  and  frequency  of  occurrence  of  the  Bart lett -0 ' Leary 
Models  is  included  in  Table  56. 

As  can  be  seen  the  use  of  less  traditional  predictors  does  not  reduce  the 
frequency  of  occurrence  of  test  bias.  Test  bias  appears  to  occur  at  approximately 
the  same  frequency  regardless  of  the  nature  of  the  predictor. 

Studies  #1  and  #7  employed  work  sample  tests  in  the  form  of  a  simulation  of 
the  Job  along  with  tests  of  motor  skills.  The  result  was  a  disappointing  lack  of 
validity  for  most  measures  employed. 

Studies  #5,  and  #6  took  a  new  look  at  the  formerly  disappointing  approach 
of  the  culture-free  test  by  examining  difference  scores  between  culture -loaded 
tests  and  the  culture-free  tests  as  an  index  of  cultural  deprivation.  This  tech¬ 
nique  which  had  been  suggested  by  Guion  (1966)  proved  equally  disappointing. 

Study  #<?  and  #3  examined  biographical  information  as  a  predictor  of  academic 
performance  for  high  school  and  college  students,  respectively.  Although  bio¬ 
graphical  information  shows  some  promise  as  a  predictor  for  the  high  school 
students,  the  problem  of  differential  performance  on  the  criterion  would  lead  to 
unfair  bias  in  the  measure  without  differential  prediction.  With  differential 
prediction  the  results  would  be  similar  to  those  produced  by  the  traditional 
ability  measures. 

Study  #8  provided  the  widest  disparity  from  traditional  predictor-criterion 
performance  studies,  by  studying  learning  measures  as  predictors  as  well  as 
criteria  in  a  laboratory  setting.  The  use  of  traditional  ability  measures  on 
subgroups  from  culturally  deprived  backgrounds  has  been  criticized  for  the  dif¬ 
ferential  learning  of  the  concepts  required  of  these  tests  which  may  have  taken 
place  as  a  result  of  the  cultural  deprivation.  Thus  by  examining  learning  measures 
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in  the  laboratory  rather  than  measures  of  previously  learned  Material  (i.e. , 
traditional  tests).  It  was  hoped  to  reduce  the  bias  for  Minority  groups.  The 
Measures  fron  the  simple  learning  tasks  showed  some  promise  of  validity  in  pre¬ 
dicting  learning  of  complex  Material,  and  in  some  cases  eliminated  differential 
predictor  performance  between  the  white  and  Negro  groups.  The  learning  Measures, 
at  least  with  some  criteria,  showed  superior  validity  for  the  Negro  group.  Con¬ 
sidering  the  finding  that  traditional  tests  usually  show  higher  validities  for 
the  white  group,  this  is  encouraging.  However  the  superior  performance  on  the 
criterion  by  the  white  group,  combined  with  the  finding  that  the  traditional  in¬ 
telligence  measure  predicted  more  criteria  than  any  other  predictor,  leads  to 
the  conclusion  that  learning  measures  may  provide  only  a  slight  glimmer  of  hope 
as  unbiased  predictors  rather  than  providing  a  final  solution  to  the  problem.  An 
attempt  to  extend  this  study  to  validate  miniature  learning  measures  as  predictors 
of  training  criteria  would  appear  warranted. 

Ponce  de  Leon  searched  for  the  Fountain  of  Touth,  Lancelot  for  the  Holy 
Grail.  The  "Unbiased  Predictor"  has  not  been  found  either. 
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APPENDIX  A  -  CRITERION  TEST 


MEASUREMENT  TEST 

Blacken  in  the  correct  ansver  on  the  Answer  Sheet  provided.  DO  NOT  write 
in  the  test  booklet. 


1.  A  measure  of  central  tendency  which  depends  on  every  score  in  the 
distribution  is  the: 

(a)  mean 

(b)  median 

(c)  mode 

(d)  all  of  the  above 

2.  What  is  the  median  of  the  following  set  of  scores? 

27,  28,  29,  30,  31,  31,  31 

(a)  30 

(b)  31 

(c)  28.1 

(d)  27.5 


3-  What  percentile  rank  corresponds  to  a  score  one  standard  deviation 
above  the  mean? 


(a)  16 

(b)  34 

(c)  50 

(d)  84 


4.  John  has  computed  the  dispersion  of  a  first  grade  class  on  their  final 
spelling  test.  Unknowingly,  John  has  included  the  teacher's  test  score 
in  the  measure.  If  John  eliminates  the  teacher's  test  score,  which  one 
of  the  following  measures  will  not  be  affected? 

(a)  standard  deviation 

(b)  inclusive  range 

(c)  interquartile  range 

(d)  percentile  rank 
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5*  One  extreme  score  can  effect  which  of  the  follow it^  Measure  s? 

(a)  mean 

(b)  inclusive  range 

(c)  both  a  and  b 

(d)  none  of  the  above 

6.  Assume  that  IQ  scores  are  normally  distributed  with  a  mean  of  100 

a  standard  deviation  of  15*  What  proportion  of  people  have  scores  above 
115-? 

(a)  16 

(b)  3^ 

(c)  50 

(d)  84 

7.  In  a  normal  distribution: 

(a)  at  least  75%  of  the  cases  lie  on  one  side  of  the  mean 

(b)  68%  of  the  cases  lie  between  plus  and  minus  one  standard 
deviation  from  the  mean 


(c)  the  median  must  fall  either  above  or  below  the  mean 

(d)  all  scores  lie  within  one  standard  deviation  from  the  mean 


distribution? 


ing 

frequency 

(a) 

11 

(b) 

14 

(c) 

16,  17,  18 

(d) 

20 
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9*  Which  of  the  following  measures  is  defined  as  the  distance  from  the 
highest  to  lowest  score  plus  one: 

(a)  standard  deviation 

(b)  inclusive  range 

(c)  quartile 

(d)  semi-quart ile  range 

10.  Which  of  the  following  is  a  measure  of  the  dispersion  of  a  set  of  scores? 

(a)  range 

(b)  standard  deviation 

(c)  interquartile  range 

(d)  all  of  the  above 

11.  If  a  distribution  is  normal,  the  approximate  per  cent  of  cases  falling 
within  two  standard  deviations  from  the  mean  in  either  direction  is: 

(a)  100 

(b)  92 

(c)  68 

(d)  50 

12.  Find  the  inclusive  range  of  the  following  set  of  scores. 

100,000  ;  50,000  ;  10,000  ;  1,000 

(a)  99,001 

(b)  40,000 

(c)  20,000 

(d)  none  of  the  toove 


13 .  If  60  per  cent  of  the  students  score  less  than  2k  points  on  a  test,  a 

score  of  2k  has  a _  of  60. 

(a)  percentile 

(b)  percentile  rank 

(c)  quart ile 

(d)  none  of  the  above 

14.  When  a  score  in  the  distribution  is  changed,  the  _ will 

always  change. 

(a)  median 

(b)  mean 

(c)  mode  and  median 

(d)  mode,  median,  and  mean 

15.  A  bowling  score  which  occurs  most  often  in  the  course  of  a  season  of 

bowling  is  the  bowler's  _ . 

(a)  median 

(b)  mode 

(c)  mean 

(d)  either  mean  or  median 
lb.  The  median  is  the: 

(a)  midpoint  between  the  largest  and  smallest  score 

(b)  midpoint  of  the  score  class  that  contains  50$  of  the  cases 

(c)  score  point  that  has  as  many  cases  below  it  as  above  it 

(d)  average  of  the  larger  and  smaller  scores 
17.  In  a  normal  distribution  the: 

(a)  mean  ^  mode  ^  median 

(b)  mean  =  mode  =  median 

(c)  mean  ^  median  ^  mode 

(d)  mode  ^  median  ^  mean 
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18.  A  devintio:.  score  is  a  measure  of  how  much  a  single  score  deviates  from 
th* _ _  of  a  group. 

(a)  mean 

(b)  median 

(c)  mode 

(d)  none  of  the  above 

19*  If  a  score  of  20  has  a  percentile  rank  of  40,  it  means  that  per  cent 

of  the  students  scored  fewer  than  _  points  on  the  test! 

(a)  20,  40 

(b)  40,  20 

(c)  60,  80 

(d)  can  not  be  determined  from  the  above  information 

20.  The  score  value  at  the  most  frequently  occurring  score  in  the  distribution 

may  be  the  _ ,  but  must  be  the  _  . 

(a)  mode,  median 

(b)  mode  and  mean,  median 

(c)  mode,  mean  and  median 

(d)  mean  and  median,  mode 

21.  Joe’s  percentile  rank  was  as  follows  on  4  tests  taken  at  the  beginning 
of  school. 

Hath  60  Georgraphy  50 


English  65 

Joe  was  above  the  mean  in: 


History 


(a)  Math,  English,  and  Georgraphy 

(b)  Math,  English  and  History 

(c)  all  of  the  above 

(d)  not  enough  information  to  determine 


22.  The  measure  which  is  based  on  only  two  scores  in  the  distribution  is: 


(a)  inclusive  range 

(b)  standard  deviation 

(c)  coefficient  of  correlation 

(d)  median 

23.  Compute  the  interquartile  range  of  the  following  set  of  scores 

Score 
13 
12 
11 
10 
9 
b 
7 

(a)  3 

(b)  6 

(c)  50 

(d)  64 

24.  The  mode  is: 

(a)  a  point  on  the  score  scale  below  which  lie  one-half  the 
scores 

(b)  the  largest  score  in  a  set  of  data 

(c)  the  sum  of  the  scores  d  vided  by  the  number  of  correct 
answers 

(d)  the  score  with  the  largest  frequency 

25.  Given  the  following  set  of  scores,  what  is  the  percentile  rank  of  a 
score  of  30? 

10  ;  20  ;  30  ;  40  ;  bO 

(a)  10 

(b)  20 

(c)  40 

(d)  60 


Percentile  Rank 
- S9 - 

75 

60 

42 

25 

13 

5 
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2b.  A  teacher  decides  to  fail  16*  of  her  class.  Pinal  class  grades  are 

roughly  normally  distributed  with  a  mean  of  72  and  a  standard  deviation 
of  6.  What  mark  must  a  student  make  to  pass  the  course? 

(a)  60 

(b)  66 

(c)  68 

(d)  72 

27-  The  third  quartile  is  equal  to  the: 

(a)  25  th  percentile 

(b)  50  th  percentile 

(c)  75  th  percentile 

(d)  none  of  the  above 

26.  If  several  mathematical  wizards  (  college  professors)  are  Included  in 
a  high  school  mathematics  class,  their  scores  will  affect  which  measure 
of  central  tendency  the  most. 

(a)  mean 

(b)  median 

(c)  mode 

(d)  depends  on  the  number  of  students  in  the  class 

29-  The  median  is  equal  to  the 

(a)  25*  tile 

( b )  50*  tile 

( c )  7 5*  t i le 

(d)  none  of  the  above 

30.  What  is  the  mean  of  the  following  set  of  data? 

3»  3,  b ,  o »  ( 1  1,  o,  9 

(a)  3-5 

(b)  7 

(c)  6  and  7 

(d)  5-1* 

lo7 


APPENDIX  B  -  PRINCIPLE  LEARNING  TEST 


Name  _ 

TEST  OF  YOUR  GENERAL  KNOWLEDGE  OF  MEASURES'® NT 


For  each  kind  of  measurement  listed  belov  decide  whether  the  scale  has 
the  characteristics  most  like  a  NOMINAL  or  ORDINAL  scale.  Place  an  N  beside 
those  which  are  NOMINAL  and  an  0  beside  those  which  are  ORDINAL. 


1. 

2. 

3- 

4. 

5- 

6. 

7. 

8. 
9. 

10. 

11. 

12. 

13- 

14. 

15. 

16. 

17. 

18. 
19- 
20. 
21. 
22. 
23- 

24. 

25. 

26. 

27. 

28. 

29. 

30. 

31. 

32. 
33- 
34. 
35- 

36. 

37. 

38. 

39. 

40. 


Postal  zones  (zip  codes) 

Dewey  Decimal  Classification  in  library 
Sex 

Win,  place,  show  at  horse  race 
Telephone  numbers 

Starting  position  in  automobile  race 
Goodness  of  handwriting 

Political  affiliation  (Republican,  Democrat,  etc.) 

Number  of  years  of  education 

Amount  of  weight  lost  on  diet 

Medical  diagnosis  (Cancer,  T.  B.,  Polio,  etc.) 

Major  league  baseball  standings 
Class  standing  at  graduation 
Grades  in  school 
Race  of  employee 
License  plate  members 
Socioeconomic  status 
Religious  denomination 

Major  in  college  (Engineering,  Liberal  Arts,  Education,  etc.) 

Route  numbers  on  highway  signs 

Girls  ranked  in  terms  of  beauty 

Marital  status 

Mean  shoe  size 

Age  in  years 

Ratings  of  job  performance 

Make  of  car  (Ford,  Plymouth,  Chevrolet,  etc.) 

Congressional  district  numbers 

Library  of  Congress  Classification  system  in  library 
Social  security  numbers 
Preference  for  types  of  ice  cream 
County  of  birth 

Income  level  of  college  professors 

Rank  in  military  service  (Major,  Captain,  Private,  etc.) 
Growth  in  inches  from  ages  10  to  15 
Intelligence  test  scores 

Psychiatric  diagnosis  (ochozphrenic,  Paranoid,  Neurotic,  etc.) 
Numbers  on  football  players'  jersey 

Subjective  pitch  (apparent  nighness  or  lowness  of  tone) 
Texonomy  in  biology  (phyla,  genera,  species,  etc.) 

Attitude  toward  Vietnam  war  (Favorable,  Neutral,  Unfavorable) 
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